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Improved Protein Expression in 
Bacillus subtills 



FIELD OF THE INVENTION 

The present invention relates to cells that have been genetically manipulated to 
have an altered capacity to produce expressed proteins. In particular, the present invention 
relates to Gram-positive microorganisms having exogenous nucleic acid sequences 
introduced therein and methods for producing proteins in such host cells, such as members 
of the genus Bacillus. More specifically, the present invention relates to the expression, 
production and secretion of a polypeptide of interest and to cells that have been genetically 
manipulated to have an altered capacity to produce expressed proteins. In particular, the 
present invention provides for the enhanced expression of a selected polypeptide by a 
microorganism. 

BACKGROUND OF THE INVENTION 

Gram-positive microorganisms, such as members of the genus Bacillus, have been 
used for large-scale industrial fermentation due, in part, to their ability to secrete their 
fermentation products into their culture media. Secreted proteins are exported across a cell 
membrane and a cell wall, and then are subsequently released into the external media. 

Indeed, secretion of heterologous polypeptides is a widely used technique in 
industry. Typically, cells are transformed with a nucleic acid encoding a heterologous 
polypeptide of interest to be expressed and thereby produce large iquantities of desired 
polypeptides. This technique can be used to produce a vast amount of polypeptide over 
what would be produced naturally. These expressed polypeptides have a number of 
industrial applications, including therapeutic and agricultural uses, as well as use in foods, 
cosmetics, cleaning compositions, animal feed, etc. Thus, increasing expression of 
polypeptides is of great interest in many fields. 

The Gram-positive eubacterium Bacillus subtilis has acquired the ability to survive in 
a complex and continuously changing environment. To address such changes in its 
environment, B. subtilis has developed complex signal transduction systems to sense a 
wide variety of extracellular stimuli. During its life cycle, B. subtilis uses two-component 
regulatory systems, each consisting of a sensor kinase and a response regulator (mostly a 
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Rev., 55:395-424 [1991]), synthesis of peptide antibiotics (Marahiel etal. t Mol. Microbiol., 
7:631-636 [1993]), production of secreted proteolytic enzymes (Kunst et a/., Res. Microbiol., 
145: 393-402 [1994]), and sporulation (Grossman, Ann. Rev. Genet., 29:477-508 [1995]). 
Upon phosphorylation of a response regulator by its cognate sensor kinase, transcription of 
a specific subset of genes is induced (Fabret et a/., J. BacterioL, 181:1975-1983 [1999]; and 
Jiang et a/., Mol. Microbiol., 38:535-542 [2000]). Fine-tuning of these regulatory systems 
can be governed by intracellular response regulators aspartyl phosphatases (Raps), and 
their antagonistic phosphatase regulators (Phrs) that serve as cell-density signalling 
molecules (Perego et a/., Mol. Microbiol., 19:1151-1170 [1996]; and Perego, Cell 76:1047- 
1055 [1998]). The Rap phosphatases have the ability to dephosphorylate their cognate 
response regulators, thereby temporally "overruling" the action of sensor kinases (Gray, 
Trends Microbiol., 5:184-188 [1997]). 

Most B. subtilis 168 rap genes are transcribed in operons with genes that encode their 
antagonistic Phr peptides (Kunst et a/., Nature 390:249-56 [1997]). While the Rap 
phosphatases remain in the cytoplasm, Phr peptides contain an amino-terminal signal peptide 
and have the potential to be exported as pro-peptides, most likely via the Sec pathway 
(Perego et a/., [1996] supra; Tjalsma et a/., Mol. Biol. Rev., 64:515-547 [2000]). Further 
possible extracellular processing may result in active Phr pentapeptides with a weakly 
conserved XRXXT sequence. These pentapeptides are thought to be re-imported via the 
oligopeptide permease (Opp) system, and specifically inhibit the activity of their cognate Rap 
phosphatase (Solomon et a/., Genes Dev., 10:2014-2024 [1996]; Perego, Proc. Natl. Acad. 
Sci. USA 94:8612-8617 [1997]; and Perego [1998] supra). However, four of the eleven known 
rap genes are not followed by functional phr genes, suggesting that these may not be involved 
in quorum-sensing processes. Interestingly, rap-phr operons were also found on several 
endogenous rolling-circle plasmids from 6. subtilis (Meijer et a/., FEMS Microbiol. Lett., 
37:283-288 [1998]). 

The chromosomally-encoded Rap-Phr systems orchestrate the sequential 
programmes of global gene transcription in a growing culture of S. subtilis cells. During 
exponential growth (low cell-densities), RapC dephosphorylates ComA~P, thereby inhibiting 
competence development. At the end of the exponential growth phase, the activity of RapC is 
antagonized by the accumulation of the PhrC peptide, and competence development is 
stimulated (Lazazzera et a/., Cell, 89: 917-925 [1997]; and Lazazzera etai % J. BacterioL, 181: 
5193-5200 [1999]). Under these conditions, sporulation is repressed, due to the stimulation of 
rapA and rapE expression via ComA~P. This results in high cellular levels of RapA and RapE 
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that together dephosphorylate SpoOF (Perego et a/., [1996] sypra; and Jiang ef a/., J. 
Bacterid., 182: 303-310 [2000]). During the post-exponential growth phase, when nutrients 
become limiting, RapA and RapE are antagonised by PhrA and PhrE, respectively, and 
sporulation is initiated via SpoOF~P. Although the production of degradive enzymes is known 
to be increased during high cell-density growth via the DegS-DegU two-component system, no 
involvement of chromosomally-encoded Rap phosphatases in the repression of protein 
secretion at low cell-densities has previously been documented. Recently, the pTA1060- 
encoded phosphatase Rap60 was shown to repress the secretion of proteolytic enzymes in S. 
subtilis 168 (Koetje et a/., Microbiology, 149: 19-28 [2003]). Expression of aprE, encoding the 
major extracellular proteolytic enzyme subtilisin, was strongly repressed in cells containing 
Rap60. However, co-production of Phr60, or the addition of synthetic Phr60 peptide, restored 
aprE transcription under these conditions (Koetje ef a/., supra). The regulatory cascade 
affected by Rap60 comprises, most likely, components of the phosphorelay (Hoch, J. Cell 
Biochem., 51:55-61 [1993]) and the transition-state regulator AbrB (Strauch and Hoch, Mol. 
Microbiol., 7:337-342 [1993]). Thus, plasmids endogenous to B. subtilis contain Rap-Phr 
systems that have a function in the cell-density controlled production of, at least one, 
expressed protease. However, the role of the chromosomal Rap-Phr system in protein 
expression remains unknown. 

SUMMARY OF THE INVENTION 

The present invention relates to cells that have been genetically manipulated to have 
an altered capacity to produce expressed proteins. In particular, the present invention 
relates to Gram-positive microorganisms having exogenous nucleic acid sequences 
introduced therein and methods for producing proteins in such host cells, such as members 
of the genus Bacillus. More specifically, the present invention relates to the expression, 
production and secretion of a polypeptide of interest and to cells that have been genetically 
manipulated to have an altered capacity to produce expressed proteins. In particular, the 
present invention provides for the enhanced expression of a selected polypeptide by a 
microorganism. 

In some embodiments, the present invention provides means for increasing the 
transcription, production and/or secretion of a protein in a cell comprising proteins involved 
in the Rap-Phr systems. 
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ln one embodiment, the present invention provides means for the hyperexpression 
(i.e., overexpression) of one or more of the phr genes, except phrC, which leads to 
enhanced expression of a polypeptide of interest. 

In a further embodiment of the invention, enhanced expression of a polypeptide of 
interest from a host cell is induced by contacting the host cell with a composition comprising 
one or more Phr proteins or pentapeptides. The Phr proteins or pentapeptides derived 
therefrom are selected from the group consisting of PhrA, PhrE, PhrF, Phrl, and PhrK. 

In additional embodiments, the present invention provides means for the inactivation 
of one or more of the rap genes or gene products, except rapC t which leads to enhanced 
expression of a polypeptide of interest. In some embodiments, the gene is inactivated by 
homologous recombination, while in other embodiments, inactivation is accomplished via in 
vivo mutagenesis, in vitro mutagenesis, or any other method known in the art. In some 
preferred embodiments a rap gene product is mRNA, while in other embodiments, it is the 
protein product. In some embodiments, the mRNA Is inactivated by contacting with an 
antisense RNA, an RNAse or an RNA binding protein that blocks translation. In some 
embodiments, a Rap protein is non-functional due to a modification in the rap gene, the 
presence of an antagonist and/or a blocker molecule. Any method that prevents the 
functional expression of one or more of the rap genes is contemplated herein. 

In one embodiment, the polypeptide of interest is a homologous/endogenous 
polypeptide the transcription/expression of which is controlled, directly or indirectly, by the 
spoA gene product. 

In another embodiment, the polypeptide of interest is a heterologous/exogenous 
polypeptide. 

Further provided herein are host cells wherein one or more of the rap genes, except 
rapC, have a decreased level of transcription and/or translation of the rap gene. 

DESCRIPTION OF THE FIGURES 

Figure 1 provides a bar graph depicting the different effects of Phr pentapeptides on 
aprE transcription. The transcription of the aprE-lacZ gene fusion in fi. subtllis AL grown in 
TY medium at 37°C in the presence of 10 pM of one of the following pentapeptides (see 
Table II): PhrA (A), PhrC, (C), PhrE (E), PhrF (F), PhrG (G), Phrl (I) or PhrK(K), or in the 
absence of pentapeptides (-), was measured 4 hours after the transition point between the 
exponential and post-exponential growth phases, p-galactosidase activities were 
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determined in Units per ODeoo (% of WT; aprE-lacZ transcription level of the strain grown in 
the absence of Phr peptides [WT] is set to 100%). 

Figure 2 provides a graph showing that PhrE stimulates aprE transcription. Time 
courses of the transcription of the aprE-lacZ gene fusion in B. subtilis AL grown in TY 
medium with at 37°C, without PhrE peptide (O; WT), and with 1 pM PhrE (► ), or 10 
PhrE (■) peptide added to the medium were determined, p-galactosidase activities were 
determined in Units per OD 6 oo^ Zero time (T 0 ) indicates the transition point between the 
exponential and post-exponential growth phases. 

Figure 3 shows that RapE is a repressor of aprE transcription. Time courses of the 
transcription of the aprE-lacZ gene fusion in 8. subtilis AL (O; WT), or 8. subtilis ArapE AL 
(■; ArapE) grown in TY medium at 37°C were determined, p-galactosidase activities were 
determined in Units per OD 6 oo. Zero time (T 0 ) indicates the transition point between the 
exponential and post-exponential growth phases. 

Figure 4 shows that there is improved protease secretion in a rapE mutant strain. 
Panel A shows results of monitoring extracellular protease production on plates. In these 
experiments, 8. subtilis 168 degU32(Hy) (WT) and 8. subtilis 168 ArapE degU32(Hy) 
(ArapE) were grown overnight at 37°C on TY-agar plates containing 1 % skim milk. The size 
of the halo's around the colonies reflects the level of proteolytic activity secreted into the 
medium. Panel B shows results of monitoring extracellular protease activity in shake flask 
cultures. In these experiments, 8. subtilis 168 degU32(Hy) (wild-type), and B. subtilis 168 
ArapE degU32(Hy) (ArapE) were grown overnight at 37°C in TY-medium. Next, proteolytic 
activity in the medium was determined using azo dye impregnated collagen (see: 
experimental procedures section; values are the average of three individual experiments). 

Figure 5, Panel A shows a schematic representation of the chromosome of 8. 
subtilis XphrE and XphrF, in which the phrE or phrF genes are integrated in the amyE gene 
and are under control of the xylose-inducible Pxyl promoter. 

Figure 5, Panel B and Panel C show that over-production of PhrE and PhrF can 
increase aprE transcription. Time courses of the transcription of the aprE-lacZ gene fusion 
in 8. subtilis AL (•; WT), 8. subtilis XphrE AL (□; PhrE) or 8. subtilis XphrFM (■; PhrF) 
and in, (B) 8. subtilis AL degU32(Hy) (•; WT), 8. subtilis XphrE AL degU32(Hy) (□ ; PhrE), 
or 8. subtilis XphrF AL degU32(Hy) (■; PhrF), grown in TY medium at 37°C were 
determined, p-galactosidase activities were determined in Units per ODeoo. Zero time (T 0 ) 
indicates the transition point between the exponential and post-exponential growth phases. 
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Figure 6, Panels A, B and C provide information regarding rapA. Panel A shows a 
schematic representation of the chromosome of B. subtilis ArapA, in which the rapA gene is 
disrupted with a chloramphenicol marker. Figure 6, Panel B and Panel C show that RapA is 
a repressor of aprE transcription. Time courses of the transcription of the aprE-lacZ gene 
fusion in B. subtilis AL (•; WT), or B. subtilis ArapA AL (■; ArapA), and B. subtilis AL 
degU32(Hy) (•; WT), or B. subtilis ArapA AL degU32(Hy) (■; ArapA) (Panel B) grown in TY 
medium at 37°C were determined. (3-galactosidase activities were determined in Units per 
OD 6 oo. Zero time (T 0 ) indicates the transition point between the exponential and post- 
exponential growth phases. 

Figure 7, Panel A shows a schematic representation of the chromosome of B. 
subtilis ArapF, in which the rapFgene is disrupted with a spectinomycin marker. Figure 7, 
Panel B and Panel C show that RapF is a repressor of aprE transcription. Time courses of 
the transcription of the aprE-lacZ gene fusion in B. subtilis AL (•; WT), or B. subtilis ArapF 
AL (■; ArapF) and in Panel B, B. subtilis AL degU32(Hy) (•; WT), or B. subtilis ArapF AL 
degU32(Hy) (■; ArapF) grown in TY medium at 37°C were determined, p-galactosidase 
activities were determined in Units per ODeoo- Zero time (T 0 ) indicates the transition point 
between the exponential and post-exponential growth phases. 

Figure 8 depicts a theoretical model for Rap-Phr-mediated regulation of AprE 
transcription. RapE is able to dephosphorylate SpoOF~P, components of the phosphorelay 
(SpoOF-SpoOB-SpoOA), thereby lowering the cellular levels of SpoOA~P. This results in 
low levels of aprE transcription, possibly via AbrB (Koetje et a/., supra). At high cell- 
densities, sufficient PhrE has been accumulated in the medium to be taken up in sufficient 
amounts by the Opp system. This results in RapE inhibition, more phosphorylated SpoOA 
and, eventually, increased aprE transcription. Notably, the addition of extracellular PhrC 
inhibits aprE transcription, possibly due to higher cellular levels of ComA-P causing a 
higher transcription level of rapE. Thus, Rap-Phr systems function as a break on the 
transcription of the aprE gene during the early growth phases (exponential growth) of the 
cell. After entry into the post-exponential growth phase this brake is, at least partly, 
removed by the accumulation of certain Phr peptides, and aprE transcription is further 
stimulated by DegU~P (accelerator). 

Figure 9 shows representative structures of transforming DNA, as described herein. 
In the top panel, homology boxes are shown flanking an incoming sequence. In the middle 
panel, other non-homologous sequences are shown to be added to the ends. In the bottom 
panel, the ends are closed such that the transforming DNA forms a closed circle or loop. 
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The bottom representation also encompasses the transforming DNA construct incorporated 
into a vector. 

Figure 10 provides a schematic diagram showing cloning by In vitro assembly and 
transformation of a competent Bacillus, in accordance with the present invention. For 
example, in some embodiments sequence 1 ("seq 1") is a phr gene, and sequence 2 ("seq 
2") is a gene encoding a polypeptide of interest. Although the kanamycin (kan) gene is 
depicted as the selection marker in this Figure, it is contemplated that any suitable selection 
maker finds use in the present invention. 

Figure 1 1 is a schematic diagram illustrating the insertion of a mutagenized incoming 
sequence into the host cell chromosome by homologous recombination. It should be noted 
that maximizing the homology between the transforming DNA and the target region of the 
chromosome can increase the transformation efficiency. In some preferred embodiments, 
the mutagenized sequence encodes a non-functional rap gene, one or more copies of the 
phr gene, a polypeptide of interest DNA sequence, etc. 

Figure 12 shows the DNA sequence for rapA (SEQ ID NO:1). 

Figure 13 shows the deduced amino acid for RapA (SEQ ID NO:2). 

Figure 14 shows the DNA sequence for phrA (SEQ ID NO:3). 

Figure 15 shows the deduced amino acid for PhrA (SEQ ID NO:4). 

Figure 16 shows the DNA sequence for rapB (SEQ ID NO:5). 

Figure 17 shows the deduced amino acid for RapB (SEQ ID NO:6). 

Figure 1 8 shows the DNA sequence for rapC (SEQ ID NO:7). 

Figure 19 shows the deduced amino acid for RapC (SEQ ID NO:8). 

Figure 20 shows the DNA sequence ioxphrC (SEQ ID NO:9). 

Figure 21 shows the deduced amino acid for PhrC (SEQ ID NO: 10). 

Figure 22 shows the DNA sequence for rapE (SEQ ID NO:11). 

Figure 23 shows the deduced amino acid for RapE (SEQ ID NO: 12). 

Figure 24 shows the DNA sequence for phrE (SEQ ID NO: 13). 

Figure 25 shows the deduced amino acid for PhrE (SEQ ID NO:14). 

Figure 26 shows the DNA sequence for rapF (SEQ ID NO:1 5). 

Figure 27 shows the deduced amino acid for RapF (SEQ ID NO:16). 

Figure 28 shows the DNA sequence for phrF (SEQ ID NO:17). 

Figure 29 shows the deduced amino acid for PhrF (SEQ ID NO:18). 

Figure 30 shows the DNA sequence for rapG (SEQ ID NO:39). 

Figure 31 shows the deduced amino acid for RapG (SEQ ID NO:40). 
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Figure 32 shows the DNA sequence for phrG (SEQ ID NO:41 ). 
Figure 33 shows the deduced amino acid for PhrG (SEQ ID NO:42). 
Figure 34 shows the DNA sequence for rapl (SEQ ID NO:1 9). 
Figure 35 shows the deduced amino acid for Rapl (SEQ ID NO:20). 
Figure 36 shows the DNA sequence for phrl (SEQ ID NO:21 ). 
Figure 37 shows the deduced amino acid for Phrl (SEQ ID NO:22). 
Figure 38 shows the DNA sequence for rapK (SEQ ID NO:23). 
Figure 39 shows the deduced amino acid for RapK (SEQ ID NO:24). 
Figure 40 shows the DNA sequence for phrK (SEQ ID NO:25). 
Figure 41 shows the deduced amino acid for PhrK (SEQ ID NO:26). 

DESCRIPTION OF THE INVENTION 

The present invention relates to cells that have been genetically manipulated to have 
an altered capacity to produce expressed proteins. In particular, the present invention 
relates to Gram-positive microorganisms having exogenous nucleic acid sequences 
introduced therein and methods for producing proteins in such host cells, such as members 
of the genus Bacillus. More specifically, the present invention relates to the expression, 
production and secretion of a polypeptide of interest and to cells that have been genetically 
manipulated to have an altered capacity to produce expressed proteins. In particular, the 
present invention provides for the enhanced expression of a selected polypeptide by a 
microorganism. 

All patents and publications, including all sequences disclosed within such patents 
and publications, referred to herein are expressly incorporated by reference. Unless defined 
otherwise herein, all technical and scientific terms used herein have the same meaning as 
commonly understood by one of ordinary skill in the art to which this invention belongs (See 
e.g., Singleton, et aL, DICTIONARY OF MICROBIOLOGY AND MOLECULAR BIOLOGY, 2D Ed., John 
Wiley and Sons, New York [1994]; and Hale & Marham, The Harper Collins Dictionary 
OF BIOLOGY, Harper Perennial, NY [1991], both of which provide one of skill with a general 
dictionary of many of the terms used herein). Although any methods and materials similar or 
equivalent to those described herein can be used in the practice or testing of the present 
invention, the preferred methods and materials are described. Numeric ranges are inclusive 
of the numbers defining the range. Unless otherwise indicated, nucleic acids are written left 
to right in 5* to 3' orientation; amino acid sequences are written left to right in amino to 
carboxy orientation, respectively. The headings provided herein are not limitations of the 
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various aspects or embodiments of the invention that can be had by reference to the 
specification as a whole. Accordingly, the terms defined immediately below are more fully 
defined by reference to the specification as a whole. 

Definitions 

As used herein, "host cell" refers to a cell that has the capacity to act as a host and 
expression vehicle for an expression cassette according to the invention. In one 
embodiment, the host cell is a Gram positive microorganism. In some preferred 
embodiments, the term refers to cells in the genus Bacillus. As used herein, the genus 
Bacillus includes all members known to those of skill in the art, including but not limited to 8. 
subtilis, B. licheniformis, B. lentus, B. brevis, B. stearothermophilus, B. alkalophilus, B. 
amyloliquefaciens, B. clausii, B. halodurans, B. megaterum, B. coagulans, B. ciculans, B. 
lautus, and B. thuringlensis. 

The term "polypeptide," as used herein, refers to a compound made up of amino acid 
residues linked by peptide bonds. The term "protein" as used herein, may be synonymous 
with the term "polypeptide" or may refer, in addition, to a complex of two or more polypeptides. 
Thus, the terms "protein," "peptide," and "polypeptide" are used interchangeably. 

Additionally, a "protein of interest," or "polypeptide of interest," refers to the 
protein/polypeptide to be expressed and secreted by the host cell. The protein of interest 
may be any protein that up until now has been considered for expression in prokaryotes 
and/or eukaryotes. In one embodiment, the protein of interest which is translocated by the 
secretion-associated proteins or systems utilized by the host cell include proteins comprising 
a signal peptide. The protein of interest may be either homologous or heterologous to the 
host. In some embodiments, the protein of interest is a secreted polypeptide, particularly an 
enzyme which is selected from amylolytic enzymes, proteolytic enzymes, cellulytic enzymes, 
oxidoreductase enzymes and plant wall degrading enzymes. In further embodiments, these 
enzyme include amylases, proteases, xylanases, lipases, laccases, phenol oxidases, 
oxidases, cutinases, cellulases, hemicellulases, esterases, peroxidases, catalases, glucose 
oxidases, phytases, pectinases, glucosidases, isomerases, transferases, galactosidases . . 
and chitinases. In still further embodiments, the expressed polypeptide is a hormone, 
growth factor, receptor, vaccine, antibody, or the like. While it is not intended that the 
present invention be limited to any particular protein/polypeptide, in some most preferred 
embodiments, the expressed polypeptide is a protease. 

As used herein, the terms "chimeric polypeptide" and "fusion polypeptide" are used 
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interchangeably to refer to a protein that comprises at least two separate and distinct 
regions that may or may not originate from the same protein. For example, a signal peptide 
linked to the protein of interest wherein the signal peptide is not normally associated with the 
protein of interest would be termed a chimeric polypeptide or chimeric protein. 

As used herein, the terms "chimeric DNA construct" and "heterologous nucleic acid 
construct," refer to a gene (/.e., one that has been introduced into a host) that is comprised 
of parts of different genes, including regulatory elements. Thus, in some embodiments, a 
chimeric gene is an endogenous gene operably linked to a promoter that is not its native 
promoter. A chimeric gene construct for transformation of a host cell is typically composed 
of a transcriptional regulatory region (promoter) operably linked to a protein coding 
sequence, or, in a selectable marker chimeric gene, to a selectable marker gene encoding a 
protein conferring antimicrobial resistance to transformed cells. In one embodiment, a 
typical chimeric gene construct of the present invention useful for transformation into a host 
cell comprises a transcriptional regulatory region that is constitutive or inducible, a signal 
peptide coding sequence, a protein coding sequence, and a terminator sequence. In other 
embodiments, the chimeric gene comprises a promoter operably linked to a phr gene. In yet 
other embodiments, the chimeric gene comprises a promoter operably linked to a gene 
encoding a protein of interest. In still further embodiments, chimeric gene constructs also 
comprise a second DNA sequence encoding a signal peptide if secretion of the target 
protein is desired. 

A "signal peptide," as used herein, refers to an amino-terminal extension on a protein 
to be secreted. "Signal sequence" is used interchangeably herein. In most preferred 
embodiments, secreted proteins use an amino-terminal protein extension which plays a 
crucial role in the targeting to and translocation of precursor proteins across the membrane 
and which is proteolytically removed by a signal peptidase during or immediately following 
membrane transfer. 

As used herein, the term "enhanced" refers to improved production of proteins of 
interest. In preferred embodiments, the present invention provides enhanced (/.©., 
improved) production and secretion of a protein of interest. In these embodiments, the 
"enhanced" production is improved as compared to the normal levels of production by the 
host (e.g., wild-type cells). Thus, for heterologous proteins, basically any expression is 
enhanced, as the cells normally do not produce the protein. 

As used herein, the terms "isolated" and "purified" refer to a nucleic acid or amino 
acid that is removed from at least one component with which it is naturally associated. 
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As used herein, the term "heterologous protein" refers to a protein or polypeptide that 
does not naturally occur in a host cell. Examples of heterologous proteins include enzymes 
such as hydrolases including proteases, cellulases, amylases, other carbohydrases, and 
lipases; isomerases such as racemases, epimerases, tautomerases, or mutases; 
transferases, kinases and phosphatases, hormones, growth factors, cytokines, antibodies 
and the like. Thus, in some embodiments, heterologous proteins comprise therapeutically 
significant protein or peptides (e.g., growth factors, cytokines, ligands, receptors and 
inhibitors), as well as vaccines and antibodies. In alternate embodiments, the protein is a 
commercially important industrial protein or peptide. It is intended that the term encompass 
protein that are encoded by naturally occurring genes, mutated genes, and/or synthetic 
genes. 

As used herein, the terms "heterologous nucleic acid construct" and "heterologous 
nucleic acid sequence" refer to a portion of a genetic sequence that is not native to the cell 
in which it is expressed. "Heterologous," with respect to a control sequence refers to a 
control sequence {i.e., promoter or enhancer) that does not function in nature to regulate the 
same gene the expression of which it is currently regulating. Generally, heterologous 
nucleic acid sequences are not endogenous to the cell or part of the genome in which they 
are present, and have been added to the cell, by infection, transfection, microinjection, 
electroporation, or the like. In some embodiments, "heterologous nucleic acid constructs" 
contain a control sequence/DNA coding sequence combination that is the same as, or 
different from a control sequence/DNA coding sequence combination found in the native 
cell. 

As used herein, the term "homologous protein" refers to a protein or polypeptide 
native or naturally occurring in a host cell. The present invention encompasses host cells 
producing the homologous protein via recombinant DNA technology. The present invention 
further encompasses a host cells with one or more deletions or one or more interruptions of 
the nucleic acid encoding the naturally occurring homologous protein or proteins, such as, 
for example, a protease, and having nucleic acid encoding the homologous protein or 
proteins re-introduced in a recombinant form (/.©., in an expression cassette). In other 
embodiments, the host cell produces the homologous protein. 

As used herein, the term "nucleic acid molecule" includes RNA, DNA and cDNA 
molecules. It will be understood that, as a result of the degeneracy of the genetic code, a 
multitude of nucleotide sequences encoding a given protein may be produced. 

As used herein, the term "vector" refers to a nucleic acid construct designed for 
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transfer between different host cells. An "expression vector" refers to a vector that has the 
ability to incorporate and express heterologous DNA fragments in a foreign cell. Many 
prokaryotic and eukaryotic expression vectors are commercially available. Selection of 
appropriate expression vectors is within the knowledge of those having skill in the art. 

As used herein, the terms "expression cassette" and "expression vector" refer to a 
nucleic acid construct generated recombinantly or synthetically, with a series of specified 
nucleic acid elements that permit transcription of a particular nucleic acid in a target cell. 
The recombinant expression cassette can be incorporated into a plasmid, chromosome, 
mitochondrial DNA, plastid DNA, virus, or nucleic acid fragment. Typically, the recombinant 
expression cassette portion of an expression vector includes, among other sequences, a 
nucleic acid sequence to be transcribed and a promoter. 

As used herein, the term "plasmid" refers to a circular double-stranded (ds) DNA 
construct used as a cloning vector, and which forms an extrachromosomal self-replicating 
genetic element in many bacteria and some eukaryotes. 

As used herein, the term "selectable marker-encoding nucleotide sequence" refers to 
a nucleotide sequence which is capable of expression in the host cells and where 
expression of the selectable marker confers to cells containing the expressed gene the 
ability to grow in the presence of a corresponding selective agent or lack of an essential 
nutrient. 

As used herein, the term "selectable marker" refers to a gene capable of expression 
in host cell which allows for ease of selection of those hosts containing the vector. 
Examples of such selectable markers include but are not limited to antimicrobials, (e.g., 
kanamycin, erythromycin, actinomycin, chloramphenicol and tetracycline). Thus, the term 
"selectable marker" refers to genes that provide an indication that a host cell has taken up 
an incoming DNA of interest or some other reaction has occurred. Typically, selectable 
markers are genes that confer antimicrobial resistance or a metabolic advantage on the host 
cell to allow cells containing the exogenous DNA to be distinguished from cells that have not 
received any exogenous sequence during the transformation. A "residing selectable 
marker" is one that is located on the chromosome of the microorganism to be transformed, 
A residing selectable marker encodes a gene that is different from the selectable marker on 
the transforming DNA construct. 

As used herein, the term "promoter refers to a nucleic acid sequence that functions 
to direct transcription of a downstream gene. In preferred embodiments, the promoter is 
appropriate to the host cell in which the target gene is being expressed. The promoter, 
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together with other transcriptional and translational regulatory nucleic acid sequences (also 
termed "control sequences") is necessary to express a given gene. In general, the 
transcriptional and translational regulatory sequences include/ but are not limited to, 
promoter sequences, ribosomal binding sites, transcriptional start and stop sequences, 
translational start and stop sequences, and enhancer or activator sequences. 

A nucleic acid is "operably linked" when it is placed into a functional relationship with 
another nucleic acid sequence. For example, DNA encoding a secretory leader (/.e., a 
signal peptide), is operably linked to DNA for a polypeptide if it is expressed as a preprotein 
that participates in the secretion of the polypeptide; a promoter or enhancer is operably 
linked to a coding sequence if it affects the transcription of the sequence; or a ribosome 
binding site is operably linked to a coding sequence if it is positioned so as to facilitate 
translation. Generally, "operably linked" means that the DNA sequences being linked are 
contiguous, and, in the case of a secretory leader, contiguous and in reading phase; 
However, enhancers do not have to be contiguous. Linking is accomplished by ligation at 
convenient restriction sites. If such sites do not exist, the synthetic oligonucleotide adaptors 
or linkers are used in accordance with conventional practice. 

As used herein, the term "gene" means the segment of DNA involved in producing a 
polypeptide chain, that may or may not include regions preceding and following the coding 
region (e.g. 5* untranslated (5* UTR) or "leader" sequences and 3' UTR or "trailer" 
sequences), as well as intervening sequences (introns) between individual coding segments 
(exons). 

In some embodiments, the gene encodes therapeutically significant proteins or 
peptides, such as growth factors, cytokines, ligands, receptors and inhibitors, as well as 
vaccines and antibodies. The gene may encode commercially important industrial proteins 
or peptides, such as enzymes (e.g., proteases, carbohydrases such as amylases and 
glucoamylases, cellulases, oxidases and lipases). However, it is not intended that the 
present invention be limited to any particular enzyme or protein. In some embodiments, the 
gene of interest is a naturally-occurring gene, a mutated gene or a synthetic gerte. 

A nucleic acid sequence is considered to be "selectively hybridizable" to a reference 
nucleic acid sequence if the two sequences specifically hybridize to one another under 
moderate to high stringency hybridization and wash conditions. Hybridization conditions are 
based on the melting temperature I of the nucleic acid binding complex or probe. For 
example, "maximum stringency" typically occurs at about Tm-5°C (5° below the Tm of the 
probe); "high stringency" at about 5-1 0°C below the Tm; "intermediate stringency" at about 
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10-20°C below the Tm of the probe; and low stringency" at about 20-25°C below the Tm. 
Functionally, maximum stringency conditions may be used to identify sequences having 
strict identitytor near-strict identity with the hybridization probe; while high stringency 
conditions are used to identify sequences having about 80% or more sequence identity with 
the probe. 

Moderate and high stringency hybridization conditions are well known in the art. An 
example of high stringency conditions includes hybridization at about 42°C in 50% 
formamide, 5X SSC, 5X Denhardt's solution, 0.5% SDS and 100 ng/ml denatured carrier 
DNA followed by washing two times in 2X SSC and 0.5% SDS at room temperature and two 
additional times in 0.1X SSC and 0.5% SDS at 42°C. 

As used herein, "recombinant" includes reference to a cell or vector, that has been 
modified by the introduction of a heterologous nucleic acid sequence or that the cell is derived 
from a cell so modified. Thus, for example, recombinant cells express genes that are not found in 
identical form within the native (non-recombinant) form of the cell or express native genes that are 
otherwise abnormally expressed, under expressed or not expressed at all as a result of deliberate 
human intervention. "Recombination, "recombining ," or generating a "recombined" nucleic acid is 
generally the assembly of two or more nucleic acid fragments wherein the assembly gives rise to a 
chimeric gene. 

As used herein, the terms "transformed," "stably transformed," and "transgenic" used 
in reference to a cell means the cell has a non-native (heterologous) nucleic acid sequence 
integrated into its genome or as an episomal plasmid that is maintained through two or more 
generations. 

As used herein, the term "expression" refers to the process by which a polypeptide is 
produced based on the nucleic acid sequence of a gene. The process includes both 
transcription and translation. 

As used herein, the term "introduced" used in the context of inserting a nucleic acid 
sequence into a cell, means "transfection," "transformation," or "transduction," and includes 
reference to the incorporation of a nucleic acid sequence into a eukaryotic or prokaryotic cell 
where the nucleic acid sequence may be incorporated into the genome of the cell (e.g., 
chromosome, plasmid, plastid, or mitochondrial DNA), converted into an autonomous 
replicon, or transiently expressed (for example, transfected mRNA). 

As used herein, "transforming DNA," "transforming sequence," and "DNA construct" 
refer to DNA that is used to introduce sequences into a host cell or organism. Transforming 
DNA is DNA used to introduce sequences into a host cell or organism. The DNA may be 
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generated in vitro by PGR or any other suitable techniques. A typical structure of 
transforming DNA is shown in a schematic form in Figure 9. The transforming DNA 
comprises an incoming sequence. It may further comprise an incoming sequence flanked 
by homology boxes. In a further embodiment, the transforming DNA may comprise other 
non-homologous sequences, added to the ends (/.a, stuffer sequences or flanks). The 
ends can be closed such that the transforming DNA forms a closed circle, such as, for 
example, insertion into a vector. 

In a preferred embodiment, mutant DNA sequences are generated with site 
saturation mutagenesis in at least one codon. In another preferred embodiment, site 
saturation mutagenesis is performed for two or more codons. In a further embodiment, 
mutant DNA sequences have more than 40%, more than 45%, more than 50%, more than 
55%, more than 60%, more than 65%, more than 70%, more than 75%, more than 80%, 
more than 85%, more than 90%, more than 95%, or more than 98% homology with the wild- 
type sequence. In alternative embodiments, mutant DNA is generated in vivo using any 
known mutagenic procedure such as, for example, radiation, nitrosoguanidine and the like. 
The desired DNA sequence is then isolated and used in the methods provided herein. 

In an alternative embodiment, the transforming DNA sequence comprises homology 
boxes without the presence of an incoming sequence. In this embodiment, it is desired to 
delete the endogenous DNA sequence between the two homology boxes. Furthermore, in 
some embodiments, the transforming sequences are wild-type, while in other embodiments, 
they are mutant or modified sequences. In addition, in some embodiments, the transforming 
sequences are homologous, while in other embodiments, they are heterologous. 

As used herein, the term "incoming sequence" refers to a DNA sequence that is 
introduced into the Bacillus chromosome or genome. In preferred embodiments, the 
incoming sequence encodes one or more proteins of interest. In some embodiments, the 
incoming sequence comprises a sequence that may or may not already be present in the 
genome of the cell to be transformed (/.e., it may be either a homologous or heterologous 
sequence). In some embodiments, the incoming sequence encodes one or more proteins 
of interest, a gene, and/or a mutated or modified gene. In some embodiments, the 
incoming sequence includes a selectable marker, such as a gene that confers resistance to 
an antimicrobial. 

In one embodiment, the incoming sequence encodes at least one heterologous 
protein including, but not limited to hormones, enzymes, and growth factors. In another 
embodiment, the enzyme includes, but is not limited to hydrolases, such as protease, 
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esterase, lipase, phenol oxidase, permease, amylase, pullulanase, cellulase, glucose 
isomerase, laccase and protein disulfide isomerase. 

In alternative embodiments, the incoming sequence encodes a functional wild-type 
gene or operon, a functional mutant gene or operon, or a non-functional gene or operon. In 
some embodiments, the non-functional sequence may be inserted into a gene to disrupt 
function of the gene. 

As used herein, the term "target sequence" refers to a DNA sequence in the host cell 
that encodes the sequence where it is desired for the incoming sequence to be inserted into 
the host cell genome. In some embodiments, the target sequence encodes a functional 
wild-type gene or operon, while in other embodiments the target sequence encodes a 
functional mutant gene or operon, or a non-functional gene or operon. 

As used herein, a "flanking sequence" refers to any sequence that is either upstream 
or downstream of the sequence being discussed (e.g., for genes ABC, gene B is flanked 
by the A and C gene sequences). In a preferred embodiment, the incoming sequence is 
flanked by a homology box on each side. In another embodiment, the incoming sequence 
and the homology boxes comprise a unit that is flanked by stuffer sequence on each side. 
In some embodiments, a flanking sequence is present on only a single side (either 3' or 5'), 
but in preferred embodiments, it is on each side of the sequence being flanked. The 
sequence of each homology box is homologous to a sequence in the Bacillus chromosome. 
These sequences direct where in the Bacillus chromosome the new construct gets 
integrated and what part of the Bacillus chromosome will be replaced by the incoming 
sequence. 

As used herein, the term "stuffer sequence" refers to any extra DNA that flanks 
homology boxes (typically vector sequences). However, the term encompasses any non- 
homologous DNA sequence. Not to be limited by any theory, a stuffer sequence provides a 
noncritical target for a cell to initiate DNA uptake. 

As used herein, the term "chromosomal integration" refers to the process whereby 
the incoming sequence is introduced into the chromosome of a "host cell (e.g., Bacillus). 
The homology boxes of the transforming DNA align with homologous regions of the 
chromosome. Subsequently, the sequence between the homology boxes is replaced by the 
incoming sequence in a double crossover (/.e., homologous recombination). 

As used herein, the term "homologous recombination" refers to the exchange of DNA 
fragments between two DNA molecules or paired chromosomes (e.g., during crossing over) 



WO 03/070963 



PCT/US03/02961 



-17- 

at the site of identical nucleotide sequences. In a preferred embodiment, chromosomal 
integration is by homologous recombination. 

As used herein, the term "library of mutants" refers to a population of cells which are 
identical in most of their genome but include different homologues of one or more genes. 
Such libraries can be used, for example, to identify genes or operons with improved traits. 

As used herein, the terms "hypercompetenf and "super competent" mean that 
greater than 1% of a cell population is transformable with chromosomal DNA (e.g., Bacillus 
DNA). Alternatively, the terms are used in reference to cell populations in which greater 
than10% of a cell population is transformable with a self-replicating plasmid (e.g., a Bacillus 
plasmid). Preferably, the super competent cells are transformed at a rate greater than 
observed for the wild-type or parental cell population. Super competent and 
hypercompetent are used interchangeably herein. 

The present invention provides means for the expression, production and secretion of 
polypeptides of interest, as well as cells that have been genetically manipulated to have an 
altered capacity to produce expressed proteins. In particular, the involvement of the Rap-Phr 
system in protein expression and secretion is described herein. As indicated, in preferred 
embodiments of the present invention, enhancement in expression and/or secretion of a 
polypeptide of interest is mediated by chromosomally-encoded Rap-Phr systems. In one 
embodiment, the polypeptide of interest is a secretory protease. In a specific embodiment the 
RapE-PhrE system encoded by the S. subtilis 168 chromosome is involved in the control of 
transcription of the aprE gene. In other embodiments, the extracellular addition of a synthetic 
Phr pentapeptide increases protein expression. In yet another embodiment, the disruption of 
a rap gene mediates a significant increase in protein expression. In one preferred 
embodiment, the protein is AprE. In a further embodiment, the expression and/or secretion of 
proteolytic enzymes is significantly increased in a strain carrying a degU32(Hy) mutation, in 
which the rapE gene is disrupted. 

Many industrially important products (e.g., enzymes, hormones, growth factors, and 
other proteins) are produced from members of the genus Bacillus in large scale fermentation 
processes, including without limitation, proteases, lipases, amylases, and beta-glucanases. 
These products (/.e., proteins of interest) are either homologous or heterologous to the host. 
For homologous proteins, "overexpression" refers to expression above normal levels in the 
host cell. For heterologous proteins, any expression is of course "overexpression. " Thus, it 
is advantageous to have a cell that will fail to sporulate yet possesses enhanced expression 
of gene(s) of interest. 
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Bacillus subtilis uses two-component signal transduction systems to sense intra-, 
and extracellular stimuli to adapt to environmental changes. Regulator aspartate 
phosphatases (Rap) have important roles in these processes, as they can dephosphorylate 
certain response-regulators, and are themselves subject to cell-density controlled inhibition 
by secreted Phr peptides. In B. subtilis, this family of Rap phosphatases is encoded by 
eleven chromosomal genes. During the development of the present invention it was 
determined that the addition of certain synthetic Phr pentapeptides, among which PhrE, to 
the growth medium of B. subtilis 168 improved the transcription of the aprE gene, encoding 
a major extracellular protease. Similarly, disruption of the 6. subtilis 168 rapE gene 
improved the transcription of the aprE gene. Furthermore, the expression and/or secretion 
of proteolytic enzymes was shown to be significantly increased in a strain in which the rapE 
mutation was combined with a degU32(Hy) mutation for hyper secretion. Thus, Rap-Phr 
quorum-sensing mechanisms enable S. subtilis to suppress protease secretion under 
conditions of low-cell densities, as according in their natural habitat. However, under (high- 
cell density) fermentation conditions, Rap phosphatases, such as RapE, might actually 
repress protease production. Thus, the modulation of cellular levels of certain Rap 
phosphatases finds use in improving the yields of Bacillus production strains. However, it is 
not intended that the present invention be limited to any particular mechanism(s). 

Data obtained during the development of the present invention show that at least 
some chromosomally-encoded Rap-Phr systems are involved in the regulation of certain 
genes and protein expression and/or secretion. Although the experiments were focused on 
the RapE-PhrE system, it is contemplated that other chromosomally-encoded Rap-Phr 
systems are also involved in cell-density controlled protein expression and/or secretion. 
Thus, it is not intended that the present invention be limited to any particular mechanisms. 
However, this hypothesis is corroborated by some lines of evidence. First, as RapE is 
known to be involved in the cell-density controlled dephosphorylation of SpoOF-P (Jiang et 
aL, J. Bacteriol., 182:303-310 [2000]), other Rap phosphatases that are known to be acting 
on SpoOF~P, such as RapA and RapB (Perego, [1998], supra; and Tzeng et aL, Biochem., 
37:16538-16545 [1998]), are likely candidates to be involved in aprE transcription. Second, 
experiments concerning the extracellular addition of synthetic Phr peptides indicated that at 
least one other Rap phosphatase of unknown function, RapF, has an effect on aprE 
transcription similar to that of RapE (See, Figure 1). Third, experiments to assess the 
extracellular addition of Phr peptides indicated that RapC antagonizes the function of RapE. 
As RapC is known to be involved in the transcription of the rapA and rapE genes, via the 
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dephosphorylation of ComA~P (Jiang et a/., [2000], supra), this is most likely an indirect 
effect of higher cellular levels of RapA and RapE. Figure 8 provides a putative model for 
Rap-Phr-mediated aprE transcription. However, as indicated throughout the present 
Specification, it is not intended that the present invention be limited to any particular 
mechanism(s). RapE is able to dephosphorylate SpoOF~P, a component of the 
phosphorelay (SpoOF-SpoOB-SpoOA), thereby lowering the cellular levels of SpoOA~P. At 
high cell-densities, sufficient PhrE has been accumulated in the medium to be taken up in 
sufficient amounts. This results in RapE inhibition, more phosphorylated SpoOA and, 
eventually, increased aprE transcription. Notably, extracellularly added PhrC inhibits aprE 
transcription, possibly due to higher cellular levels of ComA~P causing a higher transcription 
level of rapE. Thus, Rap-Phr systems function as a brake on the transcription of the aprE 
gene during the early growth phases (exponential growth) of the cell. After entry into the 
post-exponential growth phase this brake is, at least partly, removed by the accumulation of 
certain Phr peptides, and aprE transcription is further stimulated by DegU~P. 

Together, these results indicate that Rap-Phr systems of S. subtilis are suitable for 
exploitation in cell-density controlled production of industrial proteins. The potential power of 
these systems is that they function as a brake on the transcription of genes, such as aprE, 
during the early growth phases (exponential growth) of the cell. By modulating the cellular 
levels of Rap/Phr, this brake time can, conceivably, be shortened or extended. The latter is 
desirable for the production of deleterious proteins which inhibit the growth of 8. subtilis. By 
delaying the initiation of transcription of the genes for such proteins until the (late) post- 
exponential growth phase, the growth inhibitory effect is be of minor importance. In contrast, 
the removal of certain Rap-Phr systems, such as RapE-PhrE, removes part of the brake 
during the exponential growth phases, which is contemplated to have an positive effect on 
bulk enzyme production. Notably, the present results indicate that overproduction of RapC 
might have a similar effect on protease secretion as the depletion of RapE. 

B. subtilis rap and phr Nucleotide Sequences 

As indicated above, sequences for various Rap and Phr proteins and the genes that 
encode them are shown in Figures 12 through 37. The Sequence Identification Numbers 
are provided in the Description of the Figures. 
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Proteins of Interest 

The present invention is particularly useful in enhancing the intracellular and/or 
extracellular production of proteins. In some embodiments, the protein is homologous, while 
in other embodiments, it is heterologous. The present invention finds use in the production 
of various proteins, including but not limited to hormones, enzymes, growth factors, 
cytokines, antibodies, and the like. The present invention finds particular use in the 
production of enzymes, including but not limited to hydrolases, such as proteases, 
esterases, lipases, phenol oxidase, permeases, amylases, pullulanases, cellulases, glucose 
isomerase, laccases and protein disulfide isomerases. However, the present invention also 
finds use in the production of hormones, including but are not limited to, follicle-stimulating 
hormone, luteinizing hormone, corticotropin-releasing factor, somatostatin, gonadotropin 
hormone, vasopressin, oxytocin, erythropoietin, insulin and the like. 

Growth factors are proteins that bind to receptors on the cell surface, with the 
primary result of activating cellular proliferation and/or differentiation. In addition to enzymes 
and hormones, the present invention finds use in the production of growth factors including 
but are not limited to, platelet-derived growth factor, epidermal growth factor, nerve growth 
factor, fibroblast growth factors, insulin-like growth factors, transforming growth factors, etc. 
Cytokines are a unique family of growth factors. Secreted primarily from leukocytes, 
cytokines stimulate both the humoral and cellular immune responses, as well as the 
activation of phagocytic cells. Cytokines include, but are not limited to, colony stimulating 
factors, the interleukins (e.g., IL-1 [a and P], IL-2 through IL-13) and the interferons (a, 0 and 
Y). Human lnterleukin-3 (IL-3) is a 15 kDa protein containing 133 amino acid residues. IL-3 
is a species-specific colony stimulating factor which stimulates colony formation of 
megakaryocytes, neutrophils, and macrophages from bone marrow cultures. 

In addition to the above proteins, it is contemplated that the present invention will find 
use in embodiments involving antibodies. Antibodies include, but are not limited to, 
immunoglobulins from any species from which it is desirable to produce large quantities. It 
is especially preferred that the antibodies are human antibodies. Immunoglobulins may be 
from any class (/.e., IgG, IgA, IgM, IgE, or IgD). 

In some embodiments of the present invention, the protein of interest is fused to a 
signal peptide. Signal peptides from two secretory pathways are specifically contemplated 
by the present invention. The first pathway is the sec-dependent pathway. This pathway is 
well characterized and a number of putative signal sequences have been described. It is 
intended that all sec-dependent signal peptides be encompassed by the present invention. 
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Specific examples include, but are not limited to the AmyL and the AprE sequences. The 
AmyL sequence refers to the signal sequence for a-amylase and AprE refers to the AprE 
signal peptide sequence (AprE is subtilisin [also referred to as alkaline protease] of B. 
subtilis). Any signal sequence derived from any source may be used as long as it is 
functional (/.e., directs the protein of interest into the secretory pathway), in the host cell. 

Host Cells 

The present invention provides host microorganisms and expression means for the 
expression, production and secretion of desired proteins in Gram-positive microorganisms, 
such as members of the genus Bacillus. 

In a general embodiment, the present methods find use in enhancing the expression 
and/or secretion of any protein of interest produced intracellular^ or secreted via the Sec- 
dependent secretion pathway. Thus, any protein of interest that may be fused to a Sec- 
dependent signal peptide by recombinant DNA methods finds use in the present invention. 

In preferred embodiments, the host cell is rendered capable of enhanced expression 
and/or secretion of a protein of interest by transforming the host cell with nucleotide 
sequences encoding at least one protein in the Rap-Phr system or modified sequence(s) 
thereof. In some embodiments, the protein of interest is endogenous, while in other 
embodiments, it is heterologous. In some embodiments, the protein of interest is a chimeric 
protein in which a native protein of interest is fused to a Sec-dependent signal sequence. In 
a preferred embodiment, the at least one Rap-Phr gene is operably linked to a promoter. In 
some embodiments, the promoter is constitutive, while in other embodiments, it is inducible. 
A preferred promoter is the B. subtilis aprE promoter. It is further contemplated that by 
varying the level of induction, of an inducible promoter it is possible to modulate the 
expression of the gene product and thereby modulate the secretion of the protein of interest. 

In some embodiments, the host cell is a Gram-positive cell. In preferred 
embodiments, the Gram-positive cell is a member of the genus Bacillus. As used herein, 
the genus Bacillus includes all members known to those of skill in the art, including but not 
limited to B. subtilis, B. licheniformis, B. lentus, B. brevls, B. steamthermophilus, B. 
amyloliquefaciens, B. halodurans, B. clausii, B. coagulans, B. cirvulans, B. megaterum, B. 
lautus, and B. thuringiensis. In some particularly preferred embodiments, the member of the 
genus Bacillus is selected from the group consisting of B. subtilis, B. clausii, and B. 
licheniformis. 
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rap Genes and Rap Proteins 

In some embodiments of the present invention, at least one rap gene or gene 

product is "non-functional," in that the gene or gene product is unable to exert its known 
function. 

In some preferred embodiments, the rap gene is modified. As used herein, the 
terms "modified rap sequence" and modified rap genes" are used interchangeably and refer 
to at least one deletion, insertion, interruption and/or mutation of the naturally occurring rap 
nucleic acid sequence. The expression product of the modified sequence may be a 
truncated protein if the modification is a deletion or interruption of the sequence or mutation 
altering its functionality. In some preferred embodiments, the modified Rap protein does not 
retain biological activity. In alternative embodiments, the expression product of the modified 
sequence is an elongated protein (e.g., in embodiments in which the modification Is an 
insertion into the nucleic acid sequence). An insertion may lead to a truncated protein as 
the expression product if the insertion results in the formation of a stop codon. Thus, as 
known in the art, depending upon the nature of the insertion, an insertion may result in either 
a truncated protein or an elongated protein as an expression product. 

In alternative embodiments, the rap gene product is rendered non-functional. In 
some embodiments, the rap gene product comprises mRNA, while in other embodiments, it 
comprises a protein product. In some embodiments the mRNA is inactivated by contacting it 
with an antisense RNA, an RNAse or an RNA binding protein that blocks translation. Thus, 
in some embodiments, Rap protein is non-functional due to a modification in the rap gene. 
However, in other embodiments, the non-functionality results from the presence of an 
antagonist and/or a blocker molecule. 

phr Genes and Phr Gene Product 

In some embodiments of the present invention, at least one phr gene or gene 
product is overexpressed. In some embodiments, the phr gene is overexpressed by 
increasing the copy number of the selected phr gene. In other embodiments, the phr gene 
is put under the control of a stronger promoter, while in still further embodiments, it has a 
stronger ribosome binding site. 

In alternative embodiments, the phr gene product is added to the growth medium of 
the host cells. In some embodiments, the phr gene products, Phr pentapeptides (shown in 
Table II) are added. In these embodiments, the Phr protein and/or Phr pentapeptides may 
be made synthetically or are isolated from a natural source. Thus, the exogenous phr gene 
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products are present in elevated amounts in the culture medium to enhance protein 
production and/or secretion. 

Transformation of Bacillus Host Cells 

In some embodiments of the present invention, nucleic acid encoding at least one 
polypeptide of interest is introduced into a host cell via an expression vector capable of 
replicating within the host cell. Suitable replicating plasmids for Bacillus known in the art 
(See e.g., Harwood and Cutting (eds), Molecular Biological Methods for Bacillus . John Wiley 
& Sons, [1990], in particular, chapter 3; suitable replicating plasmids for B. subtllis include 
those listed on page 92). Although there are technical hurdles, those of skill in the art know 
that there are several strategies for the direct cloning of DNA in Bacillus. 

Methods known in the art to transform Bacillus, include such methods as plasmid 
marker rescue transformation, involves the uptake of a donor plasmid by competent cells 
carrying a partially homologous resident plasmid (Contente et a/., Plasmid 2:555-571 [1979]; 
Haima ef a/., Mol. Gen. Genet., 223:185-191 [1990]; Weinrauch et a/., J. Bacteriol., 
154:1077-1087 [1983]; and Weinrauch ef a/., J. Bacteriol., 169:1205-1211 [1987]). In this 
method, the incoming donor plasmid recombines with the homologous region of the resident 
"helper" plasmid in a process that mimics chromosomal transformation. 

Another method involving transformation by protoplast transformation is known in the 
art (See, Chang and Cohen, Mol. Gen. Genet, 168:1 1 1-1 15 [1979], for fl. subtilis; 
Vorobjeva ef a/., FEMS Microbiol. Lett., 7:261-263 [1980], for B. megaterium; Smith et a/., 
Appl. Env. Microbiol. ,51:634 [1986], for 6. amyloliquefaciens; Fisher et a/., Arch. Microbiol., 
139:213-217 [1981], for B. thuringiensis; McDonald [1984] J. Gen. Microbiol., 130:203 
[1984], for B. sphaericus\ and Bakhiet et a/., 49:577 [1985] B. larvae). In addition, Mann et 
a/., (Mann et a/., Curr. Microbiol., 13:131-135 [1986]) describe transformation of Bacillus 
protoplasts, and Holubova (Holubova, Microbiol., 30:97 [1985]) describe methods for 
introducing DNA into protoplasts using DNA-containing liposomes. In some preferred 
embodiments, marker genes are used in order to indicate whether or not the gene of interest 
is present in the host cell. 

In addition to these methods, in other embodiments, host cells are directly 
transformed. In "direct transformation," an intermediate cell is not used to amplify, or 
otherwise process, the DNA construct prior to introduction into the host (I.e., Bacillus) cell. 
Introduction of the DNA construct into the host cell includes those physical and chemical 
methods known in the art to introduce DNA into a host cell without insertion into a plasmid or 
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vector. Such methods include but are not limited to calcium chloride precipitation, 
electroporation, naked DNA, liposomes and the like. In yet other embodiments, the DNA 
constructs are co-transformed with a plasmid without being inserted into the plasmid. 

Vectors/Plasmids 

For expression, production and/or secretion of protein(s) of interest in a cell, an 
expression vector comprising at least one copy of a nucleic acid encoding the heterologous 
and/or homologous protein(s), and preferably comprising multiple copies, is transformed into 
the cell under conditions suitable for expression of the protein(s). 

In preferred embodiments of the present invention, expression vectors used to 
express the Phr proteins in Gram-positive microorganisms comprise at least one promoter 
associated with a Gram-positive Phr protein, wherein the promoter is functional in the host 
cell. In one embodiment, the promoter is the wild-type promoter for the selected secretion 
factor, while in an alternative embodiment, the promoter is heterologous to the secretion 
factor, but still functional in the host cell. 

Additional promoters associated with heterologous nucleic acid encoding desired 
proteins or polypeptides also find use in the present invention and may be introduced via 
recombinant DNA techniques known in the art. In one embodiment of the present invention, 
the host cell is capable of overexpressing a heterologous protein or polypeptide and nucleic 
acid encoding one or more recombinants introduced secretion factors). In one preferred 
embodiment of the present invention, nucleic acid encoding a Phr protein or a mutagenized 
Rap protein is stably integrated into the microorganism genome. In another embodiment, 
the host cell is engineered to overexpress a Phr protein and nucleic acid encoding the 
heterologous protein or polypeptide is introduced via recombinant DNA techniques. The 
present invention further encompasses Gram-positive host cells that have one or more Rap 
proteins inactivated. 

In a preferred embodiment, the expression vector contains a multiple cloning site 
cassette which preferably comprises at least one restriction endonuclease site unique to the 
vector, to facilitate ease of nucleic acid manipulation. In a preferred embodiment, the vector 
also comprises one or more selectable markers. In yet another embodiment, a multicopy 
replicating plasmid is for integration of the plasmid into the Bacilllus genomic DNA using 
methods known in the art. 
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Identlfication of Transformants 

Although the presence/absence of marker gene expression suggests that the gene 
of interest is also present, its presence and expression should be confirmed. For example, if 
the nucleic acid encoding a Phr protein is inserted within a marker gene sequence, 
recombinant cells containing the insert can be identified by the absence of marker gene 
function. Alternatively, in other embodiments, a marker gene is placed in tandem with 
nucleic acid encoding the Phr protein under the control of a single promoter. Expression of 
the marker gene in response to induction or selection usually indicates expression of the Phr 
protein as well. 

Alternatively, host cells which contain the coding sequence for a Phr protein and/or 
express the protein of interest may be identified by any means known to those of skill in the 
art. These procedures include, but are not limited to, DNA-DNA or DNA-RNA hybridization 
and protein bioassay or immunoassay techniques, including, but not limited to membrane- 
based, solution-based, or chip-based technologies for the detection and/or quantification of 
the nucleic acid or protein. 

In other embodiments, the presence of the Phr protein polynucleotide sequence is 
detected by DNA-DNA or DNA-RNA hybridization or amplification using probes, portions or 
fragments derived from the polynucleotide encoding any one of the Phr proteins. 

The presence of an inactivated or non-functional Rap protein may be detected by 
methods known in the art. For example, the hyperexpression of RAPs can cause 
expression of a spo" phenotype, which can be detected utilizing methods known in the art. 
Alternatively, in vitro detection of the dephosphorylation of spoOF~P to spoOF may indicate 
the presence of the Rap protein. 

Measuring Gene Products 

There are various assays known to those of skill in the art for detecting and 
measuring activity of intracellular^ and extracellularly expressed polypeptides (e.g., the 
protein of interest and/or phr gene products) which find use with the present invention. In 
particular, for proteases, there are assays based upon the release of acid-soluble peptides 
from casein or hemoglobin measured as absorbance at 280 nm or colorimetrically using the 
Folin method (See e.g., Bergmeyer et a/., Methods of Enzymatic Analysis , vol. 5, 
[Peptidases, Proteinases and their Inhibitors], Verlag Chemie, Weinheim [1984]). Other 
assays involve the solubilization of chromogenic substrates (See, Ward, in Fogarty (ed), 
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Proteinases. in Microbial Enzymes and Biotechnology Applied Science, London, pp. 251- 
317 [1983]). 

Means for determining the levels of secretion of a heterologous or homologous 
protein in a host cell and detecting expressed proteins include, but are not limited to 
immunoassay methods that utilize either polyclonal or monoclonal antibodies specific for the 
protein. Examples include enzyme-linked immunosorbent assay (ELISA), 
radioimmunoassay (RIA), fluorescent immunoassays (FA), and fluorescent activated cell 
sorting (FACS), as well as Western blots, etc. These assay systems are well known in the 
art (See e.g., Maddox et a/., J. Exp. Med., 158:121 1 [1983]). In some particularly preferred 
embodiments of the present invention, secretion is greater using the methods and 
compositions provided herein than when using untransformed host cells or unmodified 
media. 

Protein Purification 

In preferred embodiments, the cells transformed with polynucleotide sequences 
encoding heterologous or homologous protein or endogenously having said protein are 
cultured under conditions suitable for the expression and recovery of the encoded protein 
from the cell culture medium. In some embodiments, other recombinant constructions may 
join the heterologous or homologous polynucleotide sequences to nucleotide sequence 
encoding a polypeptide domain which will facilitate purification of soluble protein (e.g., tags 
of various sorts) (Kroll et al., DNA Cell. Biol., 1 2:441 -53 [1 993]). 

Such purification facilitating domains include, but are not limited to, metal chelating 
peptides such as histidine-tryptophan modules that allow purification on immobilized metals 
(Porath, Prot. Expr. Purif., 3:263-281 [1992]), protein A domains that allow purification on 
immobilized immunoglobulin, and the domain utilized in the FLAGS extension/affinity 
purification system (Immunex Corp, Seattle WA). The inclusion of a cleavable linker 
sequence such as Factor XA or enterokinase (Invitrogen, San Diego, CA) between the 
purification domain and the heterologous protein can be used to facilitate purification. 

From the above, it is clear that the present invention provides genetically engineered 
Gram-positive host microorganisms comprising preferably non-revertable mutations, 
including gene replacement, in at least one gene encoding a phr and/or rap gene. In some 
embodiments, compositions useful in enhancing protein production from a host cell are 
provided. In the preferred embodiments, the host cells is a member of the genus Bacillus. 
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In some preferred embodiments, the microorganism is also genetically engineered to 
produce a desired protein or polypeptide. In one such preferred embodiment, the Gram- 
positive microorganism is a member of the genus Bacillus. 

The present invention further provides the advantage that it finds use in shifting the 
proportion of the host cell population to increase the number of producing a protein of 
interest. 

EXPERIMENTAL 

The following examples are provided in order to demonstrate and further illustrate 
certain preferred embodiments and aspects of the present invention and are not to be 
construed as limiting the scope thereof. 

In the experimental disclosure which follows, the following abbreviations apply: °C 
(degrees Centigrade); rpm (revolutions per minute); H 2 0 (water); dH 2 0 (deionized water); 
(HCI (hydrochloric acid); aa (amino acid); bp (base pair); kb (kilobase pair); kD (kilodaltons); 
gm (grams); pg (micrograms); mg (milligrams); ng (nanograms); pi (microliters); ml 
(milliliters); mm (millimeters); nm (nanometers); pm (micrometer); M (molar); mM (millimolar); 
pM (micromolar); U (units); V (volts); MW (molecular weight); sec (seconds); min(s) 
(minute/minutes); hr(s) (hour/hours); MgCI 2 (magnesium chloride); NaCI (sodium chloride); 
OD 2 bo (optical density at 280 nm); ODeoo (optical density at 600 nm); PAGE (polyacrylamide 
gel electrophoresis); PBS (phosphate buffered saline [150 mM NaCI, 10 mM sodium 
phosphate buffer, pH 7.2]); PEG (polyethylene glycol); PGR (polymerase chain reaction); 
RT-PCR (reverse transcription PCR); SDS (sodium dodecyl sulfate); Tris 
(tris(hydroxymethyl)aminomethane); w/v (weight to volume); v/v (volume to volume); LA 
medium (per liter: Difco Tryptone Peptone 20g, Difco Yeast Extract 10g, EM Science NaCI 
1g, EM Science Agar 17.5g, dH20 to 1L); ATCC (American Type Culture Collection, 
Rockville, MD); Clontech (CLONTECH Laboratories, Palo Alto, CA); Difco (Difco 
Laboratories, Detroit, Ml); GIBCO BRL or Gibco BRL (Life Technologies, Inc., Galthersburg, 
MD); Invitrogen (Invitrogen Corp., San Diego, CA); NEB (New England Biolabs, Beverly, 
MA); Sigma (Sigma Chemical Co., St. Louis, MO); Takara (Takara Bio Inc. Otsu, Japan); 
Roche Diagnostics (Roche Diagnostics, a division of F. Hoffmann La Roche, Ltd., Basel, 
Switzerland); EM Science (EM Science, Gibbstown, NJ); Qiagen (Qiagen, Inc., Valencia, 
CA); and Stratagene (Stratagene Cloning Systems, La Jolla, CA). 

In the Examples that follow, the cultures, media, and methods described below were 

used. 
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Plasmids, Bacterial Strains and Media 

Table 1 lists the plasmids and bacterial strains used during the development of the 
present invention. TY medium (tryptone/yeast extract) contained Bacto tryptone (1%), Bacto 
yeast extract (0.5%) and NaCI (1%). Extracellular proteolytic activity was tested on TY agar 
(1 .5% agar) with skim milk added (1 %). When required, media for fi. subtilis were 
supplemented with kanamycin (Km; 5 jig/ml), spectinomycin (Sp; 100 pg/ml), or 
erythromycin (1 ng/ml). 

Table I. Bacterial Strains 



Strains Relevant Properties 

B. subtilis ~— — 

168* trpC2 

AL like 168::pAL; aprE-\acZ\ Errf 

degU32 (Hy)** like 1 68; degU32 (Hy); Krrf 

ArapE like 168::pUCE5-Sp; rapE-Sp; Sp r 

ArapE degU32 (Hy) like 1 68::pUCE5-Sp; rapE-Sp; degU32 (Hy); Krrf Sp r 
ArapE AL like 1 68::pUCE5-Sp; rapE-Sp aprE-lacZ; Errf; Sp r 

ArapE AL degU32 (Hy) like 168::pUCE5-Sp; rapE-Sp aprE-lacZ; Errf; Sf/degU32(Hy)\ 

Krrf 

ArapA like 168:: pUKA-Cm; rapA-Cm, Crrf 

ArapA AL like 168::pUKA-Cm; rap/l-Cm aprE-lacZ\ Errf; Crrf 

ArapA AL deg(J32 (Hy) like 168::pUKA-Cm; rapA-Cm aprE-lacZ\ Errf; Crrf degU32 
(W,Krrf 

ArapF like 168:: pUKF-Sp; rapF-Sp; Crrf 

ArapFPL like 168::pUKF-Sp; rapF-Sp aprE-lacZ\ Errf; Crrf 

ArapF AL degU32 (Hy) like 168::pUKF-Sp; rapF-Sp aprE-iacZ\ Errf; CrrfdegU32(Hy)] 

Krrf 

XphrE like 168::pHRE; amyE::xylA-phrE t Crrf 

XphrE AL like 1 68::pHRE; amyE::xylA-phrE 9 Crrf; aprE-\acZ\ Errf 

XphrE ALdegU32{Hy) like 168::pHRE; amyE::xylA-phrE t Crrf; aprE-lacZ; Errf; 

degU32(Hy)\Kfrf 
XphrF like 1 68::pHRF; amyE::xylA-phrF, Crrf 

XphrF AL like 1 68::pHRF; amy E:\xylA-phrF, Crrf; aprE-lacZ] Errf; degU32 

(W.Krrf 

XphrFAL degU32 (Hy) like 168::pHRF; amyE::xylA-phrF 9 Crrf; aprE-lacZ; Errf 
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*See, Kunst etai (1997), supra. 

**See, Antelmann et a/., Genome Res., 1 1 :1484-502 (2001 ). 

DNA and RNA Techniques 

Procedures for DNA purification, restriction, ligation, agarose gel electrophoresis, 
and transformation of E. coli were carried out as known in the art (See e.g., Sambrook et ai, 
Molecular Cloning: A Laboratory Manual . Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, NY [1989]). Enzymes were from Roche Diagnostics. 8. subtilis was transformed as 
known in the art (See, Bron and Venema, Mutat. Res., 15:1-10 [1972]). For a high reliability, 
Pwo polymerase (Roche Diagnostics BmgH) was used. To construct 8. subtilis ArapE t a 
fragment comprising the rapE gene was amplified with the primers rapEb (5'-CCA TCG ATG 
GAT CCC ACA TCA GCTGAA GTG GGA ATG-3') (SEQ ID NO:27) and rapEb (5'-CGG 
GAT CCA TCGATG GTT GCC TCC TAA CAA TGC CAG C -3') (SEQ ID NO:28). The 
amplified fragment was cleaved with SamHI, and ligated into the corresponding sites of 
pUC7, resulting in pUCE5. Next, the rapE gene of the latter plasmid was interrupted by a 
spectinomycin resistance marker, resulting in the plasmid pUCE5-Sp. Next, the 
chromosomal rapE gene of 8. subtilis was disrupted by replacement recombination of 
pUCE5-Sp with the rapE gene of 8. subtilis 168. 

To construct 8. subtilis ArapA, a fragment comprising the rapA gene was amplified with 
the primers rapA-1 (S'-AGA ATT CAA GCT TAG AAG ACA TCG AAG GG-3') (SEQ ID 
NO:29) and phrAA (5'-ATC GGA TCC TTA TGT TTG ATT GCG TGC CGC-3') (SEQ ID 
NO:30). The amplified fragment was cleaved with Hind\l\ and BamH\, and ligated into the 
HindWl and 8g/ll sites of pUK21 , resulting in pUKA. Next, the rapA gene of the latter plasmid 
was interrupted by a chloramphenicol marker, resulting in the plasmid pUKA-Cm. Next, the 
chromosomal rapA gene of 8. subtilis was disrupted by replacement recombination of 
pUKA-Cm with the rapA gene of 8. subtilis 168. 

To construct 8. subtilis A/apF, a fragment comprising the rapF gene was amplified with 
the primers rapF-1 (5'-AGT CTC TAG ATA TGA CAT ATA TAG AAT TAT GGG AGG G-3') 
(SEQ ID NO:31 ) and rapF-2 (5'-ATG CGG ATC CGT CGC ACT CGA CGT TTC GAA CTC 
CG) (SEQ ID NO:32). The amplified fragment was cleaved with Xba\ and BamHl, and 
ligated into the Xba\ and Bg/ll sites of pUK21, resulting in pUKF. Next, a 96 bp internal 
HindUl fragment from the rapF gene of the latter plasmid was replaced by a spectinomycin 
marker, resulting in the plasmid pUKF-Sp. Next, the chromosomal rapF gene of 8. subtilis 
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was disrupted by replacement recombination of pUKF-Cm with the rapF gene of B. subtilis 
168. 

To construct B. subtilis XphrE, a fragment comprising the phrE gene was amplified with 
the primers 5'E-Xbal (5-TAT ATC TAG ATC GAA AGG GGA ATG CAT GTA TGA AAT C- 
3') (SEQ ID NO:33) and 3'E-BamHI (5'- TTT AGG ATC CAG CGC TTA AAC AAG GAA TTC 
ATG AGT AGG TGC G-3') (SEQ ID NO:34). The amplified fragment was cleaved with Xba\ 
and BamHI, and ligated into the Spel and BamHI sites of pX (See, Kim ef a/., Gene 181:71- 
76 [1996]), resulting in pHRE. Next, the chromosomal amyE gene of B. subtilis 168 was 
disrupted by replacement recombination of pHRE, containing a xylose-inducible phrE gene. 

To construct B. subtilis XphrF, a fragment comprising the phrF gene was amplified with 
the primers 5'F-Xbal (5 - TTA ATC TAG AAG GAG GAG TGA GTT TGT ATG AAA TTG-3') 
(SEQ ID NO:35) and ZT-BamH\ (5'- AAT TGG ATC CTT AAG TTA AAT CAT TCC TCG 
TTG TGC AAC TTC-3') (SEQ ID NO:36). The amplified fragment was cleaved with Xba\ and 
BamHI, and ligated into the Spel and BamHI sites of pX (See, Kim etaL, supra), resulting in 
pHRF. Next, the chromosomal amyE gene of B. subtilis 168 was disrupted by replacement 
recombination of pHRF, containing a xylose-inducible phrF gene. 

In some experiments, the degU32(Hy) mutation or the transcriptional aprEAacZ fusion, or 
both, were introduced in B. subtilis ArapE, ArapA, ArapF, XphrE or XphrF by transformation 
with chromosomal DNAs of the B. subtilis strains 168 degU32(Hy) and/or 168 AL (Table I), 
using methods known in the art. Constructed strains were selected on plates containing the 
appropriate antibiotic and checked by PCR analysis for correct integration of plasmids into the 
chromosome. 

B-Galactosidase Assays 

Overnight cultures were diluted to an optical density (ODeoo) of 0.1 in fresh TY 
medium, supplemented with Phr60 pentapeptide, as appropriate. Samples for ODeoo 
readings and p-galactosidase activity determinations were taken at hourly intervals. The 
assay and the calculation of p-galactosidase activity (expressed as units per OD 6 oo) of the 
samples was performed as known in the art (See, Miller, Experiments in Molecular Genetics , 
Cold Spring Harbor Laboratory Press, NY, [1982]). 
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Protease Activity Assays 

Medium of overnight cultures grown in TY medium was diluted ten-fold in buffer 
containing 10 mM Tris [pH 6.8]; 100 mM NaCI and 1.4 mg/ml azo dye Impregnated collagen 
(Sigma), and incubated for one hour at 37°C. Next, the release of the dye was measured by 
optical density readings at 520 nM as a quantification of the proteolytic activity. 



EXAMPLE 1 

Chromosomally-Encoded Phr Peptides are Involved in the Regulation of aprE 
To analyze whether chromosomally-encoded Rap-Phr systems might be involved in 
protease production, 6. subtilis AL (bearing an aprE-lacZ transcriptional fusion), was grown 
in TY medium supplemented with synthetic PhrA, -C, -E, -F, -G, -H, -I, or-K pentapeptides 
(Table II). As shown by p-galactosidase assays performed with cells harvested in the post- 
exponential growth phase (See, Figure 1), PhrE and PhrF had a positive effect on aprE 
transcription (±135 % relative to wild-type); PhrG, PhrA, PhrK and Phrl had a weak positive 
effect on aprE transcription (±100-120 % relative to wild-type); whereas PhrC had a clear 
negative effect on aprE transcription (± 60 % relative to wild-type). Finally, the growth rates, 
and cell densities were not significantly affected by the presence of Phr peptides in the 
culture medium (data not shown). These results indicate that the temporal transcription of 
aprE is regulated by the combined action of several Rap-Phr systems. 

Table II. Pre-Phr proteins and their active pentapeptides. Hydrophobic domains of the putative 
signal peptides are indicated in grey shading, predicted signal peptidase recognition sequences 
are in italics. Putative processing sites are indicated with an arrow. The sequences of synthetic 
Phr pentapeptides used in this study are underlined and in bold. 



PhrA 


MKSKWMSGLLLVAVGFS FTQVM VHA I GETANTEGKTFHIA 4 ARNQT (SEQ ID NO: 4) 


PhrC 


MKLKSKLFVICLAAAAI FTAAGVS ANA i EALDFHVT I ERGMT (SEQ ID NO: 10) 


PhrE 


MKSKJLFISLSAVLIGLAFFGSMYNGEMKEA i SRNVT i LAPTHEFLV 

(SEQ ID NO: 14) 


PhrF 


MKIiKSKJLLLS CLALSTV FVAT TIA I NAPTHQIEVA 1 QRGMI (SEQ ID NO: 18) 


PhrG 


MKRFLIGAGVAAVI LSGWFJA I DHQTHSQEMKVA ± EKMIG (SEQ ID NO: 38) 


Phrl 


MKI SRJ LLAAVI LS SVFS IT YLQS DHNTE IKVAA I DRVGA (SEQ ID NO: 22) 


PhrK 


MK^VLCVSILAVILSGVA I LTQLSTDSPSNIQVA ir ERPVG i G (SEQ ID NO: 37) 
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EXAMPLE 2 

Transcription of aprE is Controlled by the RapE-PhrE System 

In this Example, experiments conducted to further assess the effects of Rap-Phr 
systems on the transcription of aprE are described. The RapE-PhrE system was used as a 
model for two reasons. First, this pair of proteins shows the highest level of homology with 
the plasmid-encoded Rap60-Phr60 system (data not shown), which was shown during the 
development of the present invention to be involved in aprE transcription. Second, of all Phr 
pentapeptides tested during the development of the present invention, PhrE was shown to 
have the strongest positive effect on aprE transcription (See, Figure 1). To determine the 
effect of PhrE on aprE transcription during growth, synthetic PhrE was added at two 
concentrations (1 and 10 pg/ml) to the medium of 8. subtilis AL The results showed that 
aprE transcription was improved by the addition of PhrE only during the post-exponential 
growth phase in a PhrE concentration-dependent manner (See, Figure 2). Complementary 
to this result was the observation that aprE transcription was increased in a ArapE genetic 
background (See, Figure 3). It was also noted that in the latter case, aprE transcription 
seemed to be improved during all growth phases. Taken together, these results indicate 
that RapE is an inhibitor of aprE transcription and that RapE itself can be inhibited by the 
PhrE pentapeptide. 

EXAMPLE 3 

Improved Protease Secretion in a rapE degU32(Hy) Chromosomal Background 

A well-known activator of aprE transcription is DegU, which is phosphorylated via 
DegS (Msadek et a/., J. Bacterid., 172:824-834 [1990]; and Kunst et a/., [1994], supra). A 
degU32 (Hy) mutation results in stable Degll~P, causing hyper transcription of many 
protease genes, such as aprE (Olmos et a/., Mol. Gen. Genet., 253: 562-567 [1997]). The 
latter mutation, in combination with the aprE promoter, finds use in achieving high 
production yields in strains used for industrial fermentations. To determine whether 
disruption oirapE could improve protease secretion in a strain with a degU32 (Hy) mutation, 
a plate assay was used. As shown in Figure 4, Panel A, the halo around colonies of cells 
containing both mutations was significantly larger than that around colonies of cells 
containing only the degU32 (Hy) mutation. Similarly, the proteolytic activity in the medium of 
overnight cultures of B. subtilis 168 ArapE degU32 (Hy) was about 1 35% of that in B. subtilis 
168 degU32 (Hy) (See, Figure 4, Panel B). Thus, these results show that protease secretion 
can be improved by disruption of the rapE gene. 
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EX AMPLE 4 

Overproduction of Pre-PhrE and Pre-PhrF Can Increase the Transcription of aprE 

To analyze whether overproduction of PhrE and PhrF can increase aprE 
transcription, B. subtilis AL (bearing an aprE-lacZ transcriptional fusion) B. subtilis XphrE AL 
(xylose-inducible production of pre-PhrE; See, Figure 5, Panel A) and B. subtilis XphrFAL 
(xylose-inducible production of pre-PhrF; See, Figure 5, Panel A), were grown in TY medium 
supplemented with 1% xylose. As shown by the results of p-galactosidase assays (See, 
Figure 5, Panel B), production of pre-PhrE and pre-PhrF had a positive effect on aprE 
transcription during post-exponential growth (±135-180% relative to wild-type). Similar 
results were obtained with the same strains carrying in addition the degU32(Hy) mutation for 
hypertranscription of aprE (See, Figure 5, Panel C). However, the growth rates, and cell 
densities were not significantly affected by the induction of the phrE/F genes by xylose (data 
not shown). These results show that overproduction of pre-PhrE and pre-PhrF can improve 
aprE transcription, most likely by inhibiting RapA and RapF, respectively. However, it is not 
intended that the present invention be limited to any specific mechanism. 

EXAMPLE 5 
Transcription of aprE is Controlled by RapA 

To analyze the effects of RapA on the transcription of aprE, the rapA gene of the 
strain B. subtilis AL (bearing an aprE-lacZ transcriptional fusion) was disrupted by a 
chloramphenicol marker (See, Figure 6, Panel A). Next, B. subtilis AL and B. subtilis ArapA 
AL were grown at 37°C in TY medium. As shown by p-galactosidase assays (See, Figure 6, 
Panel B), disruption of rapA had a positive effect on aprE transcription during post- 
exponential growth (±120-200% relative to wild-type). Similar results were obtained with the 
same strains carrying in addition the degU32(Hy) mutation for hypertranscription of aprE 
(See, Figure 36, Panel C). However, the growth rates, and cell densities were not 
significantly affected by the rapA disruption (data not shown). Taken together, these results 
indicate that RapA is an inhibitor of aprE transcription, most likely by the dephosphorylation 
of SpoOF~P, similar to the function of RapE (See Figure 8, as well as Perego, [1998], supra\ 
and Tzeng et a/., [1998], supra). 
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EXAMPLE 6 
Transcription of aprE is Controlled by RapF 

To analyze the effects of RapF on the transcription of aprE, the rapF gene of the 
strain B. subtilis AL (bearing an aprE-lacZ transcriptional fusion) was disrupted by a 
spectinomycin marker (See, Figure 7, Panel A). Next, B. subtilis AL and B. subtilis ArapF AL 
were grown at 37°C in TY medium. As shown by p-galactosidase assay results (Figure 7, 
Panel B), disruption of rapF had a positive effect on aprE transcription during post- 
exponential growth (±1 10-150% relative to wild-type). Similar results were obtained with the 
same strains carrying in addition the degil32(Hy) mutation for hypertranscription of aprE 
(See, Figure 7, Panel C). However, the growth rates, and cell densities were not 
significantly affected by the rapF disruption (data not shown). Taken together, these results 
indicate that RapF is an inhibitor of aprE transcription, but the target of RapF is currently 
unknown. Indeed, it is not intended that the present invention be limited to any particular 
mechanism(s). 

All publications and patents mentioned in the above specification are herein 
incorporated by reference. Various modifications and variations of the described method 
and system of the invention will be apparent to those skilled in the art without departing from 
the scope and spirit of the invention. Although the invention has been described in 
connection with specific preferred embodiments, it should be understood that the invention 
as should not be unduly limited to such specific embodiments. Indeed, various 
modifications of the described modes for carrying out the invention that are obvious to those 
skilled in the art and/or related fields are intended to be within the scope of the present 
invention. 
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Claims 

1 . A method of altering the amount of a polypeptide of interest expressed from a 
cell comprising deletion of one or more rap genes selected from the group consisting of 
rapA t rapE, rapF, rapG, rap/, and rapK. 

2. The method of Claim 1 , wherein said polypeptide of interest is a homologous 
polypeptide. 

3. The method of Claim 2, wherein said homologous polypeptide is a protease. 

4. The method of Claim 3, wherein said homologous polypeptide is AprE. 

5. The method of Claim 1 , wherein said polypeptide of interest is a heterologous 
polypeptide. 

6. The method of Claim 5 f wherein said heterologous polypeptide is selected 
from the group consisting of hydrolases, isomerases, transferases, kinases, phosphatases, 
hormones, growth factors, cytokines, and antibodies. 

7. The method of Claim 6, wherein said hydrolase is selected from the group 
consisting of proteases, cellulases, amylases, and lipases. 

8. The method of Claim 6, wherein said isomerase is selected from the group 
consisting of racemases, epimerases, tautomerases, and mutases. 



9. A method of altering the amount of a polypeptide of interest expressed from a 
cell comprising inactivation of one or more rap genes selected from the group consisting of 
rapA t rapE, rapF, rapG, rapl, and rapK. 
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10. The method of Claim 9, wherein said polypeptide of interest is a homologous 
polypeptide. 

1 1 . The method of Claim 1 0 f wherein said homologous polypeptide is a protease. 



12. The method of Claim 10, wherein said homologous polypeptide is AprE. 

1 3. The method of Claim 9, wherein said polypeptide of interest is a heterologous 
polypeptide. 



14. The method of Claim 3, wherein said heterologous polypeptide is selected 
from the group consisting of hydrolases, isomerases, transferases, kinases, phosphatases, 
hormones, growth factors, cytokines, and antibodies. 

15. The method of Claim 14, wherein said hydrolase is selected from the group 
consisting of proteases, cellulases, amylases, and lipases. 

16. The method of Claim 14, wherein said isomerase is selected from the group 
consisting of racemases, epimerases, tautomerases, and mutases. 



17. A method of altering the amount of a polypeptide of interest expressed from a 
cell comprising overexpression of one or more phr genes selected from the group consisting 
oiphrA, phrE, phrF, phrG, phrl, and phrK. 

18. The method of Claim 17, wherein said polypeptide of interest is a 
homologous polypeptide. 



19. The method of Claim 18, wherein said homologous polypeptide is a protease. 



20. The method of Claim 18, wherein said homologous polypeptide is AprE. 
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21 . The method of Claim 1 7, wherein said polypeptide of interest is a 
heterologous polypeptide. 

22. The method of Claim 21 , wherein said heterologous polypeptide is selected 
from the group consisting of hydrolases, isomerases, transferases, kinases, phosphatases, 
hormones, growth factors, cytokines, and antibodies. 

23. The method of Claim 22, wherein said hydrolase is selected from the group 
consisting of proteases, cellulases, amylases, and lipases. 

24. The method of Claim 22, wherein the isomerase is selected from the group 
consisting of racemases, epimerases, tautomerases, and mutases. 

25. A composition comprising one or more of the Phr proteins selected from the 
group consisting of PhrA, PhrE, PhrF, PhrG, Phrl, and PhrK. 

26. A method of altering the amount of a polypeptide of interest expressed from a 
cell comprising contacting a host cell with the composition of Claim 25. 

27. The method of Claim 26, wherein said polypeptide of interest is a 
homologous polypeptide. 

28. The method of Claim 27, wherein said homologous polypeptide is a protease. 

29. The method of Claim 27, wherein said homologous polypeptide is AprE. 

30. The method of Claim 26, wherein said polypeptide of interest is a 
heterologous polypeptide. 
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31. The method of Claim 30, wherein said heterologous polypeptide is selected 
from the group consisting of hydrolases, isomerases, transferases, kinases, phosphatases, 
hormones, growth factors, cytokines, and antibodies. 

32. The method of Claim 31 , wherein said hydrolase is selected from the group 
consisting of proteases, cellulases, amylases, and lipases. 

33. The method of Claim 31 , wherein said isomerase is selected from the group 
consisting of racemases, epimerases, tautomerases, and mutases. 

34. A vector comprising at least one nucleotide sequence from the group 
consisting of phr encoding nucleotides of SEQ ID NOs:3, 13, 17, 21, 25, and 41. 

35. The vector of Claim 34, further comprising a selection marker gene. 

36. The vector of Claim 35, further comprising a nucleotide sequence encoding a 
polypeptide of interest. 

37. An expression cassette comprising a first DNA sequence encoding at least 
one phr gene selected from the group consisting of phrA % phrE, phrF t phrG, phrl, and phrK. 

38. The expression cassette of Claim 37, further comprising a second DNA 
sequence encoding a protein of interest that can be expressed by the host cell. 

39. The expression cassette of Claim 38, further comprising a third DNA 
sequence encoding a selection marker. 
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40. A Bacillus host cell having been transformed with at least one nucleic acid 
sequence encoding at least one phr gene selected from the group consisting of phrA, phrE t 
phrF, phrG, phrl, and phrK, 

41 . The Bacillus host cell of Claim 40, further comprising a gene encoding a 
polypeptide of interest 

42. A method for producing a protein of interest comprising the steps of: 

(a) culturing under suitable conditions a Bacillus host cell comprising: 

i) a DNA encoding the protein of interest, wherein said 
host cell is 

ii) transformed with a DNA encoding a protein having the 
amino acid sequence selected from the group consisting of SEQ ID 
Nos:3, 13, 17, 21, 25, and 42; or a protein having an amino acid 
sequence at least 85% identical to a protein having the amino acid 
sequence selected from the group consisting of SEQ ID NOs: 3, 13, 
17, 21, 25, and 42, wherein said protein has a Phr function; and 

(b) allowing expression of the protein of interest. 

43. A method for producing a protein of interest comprising the steps of: 

(a) culturing under suitable conditions a Bacillus host cell comprising: a DNA 
encoding the protein of interest, wherein said host cell lacks a functional rap gene or 
gene product; and 

(b) allowing expression of the protein of interest 

44. The method of Claim 43, where in said rap gene is selected from the group 
consisting of rapA, rapB, rapE, rapF, rapG, rapt, and rapK. 

45. A method for production of a protein of interest in a host cell, comprising: 

(a) transforming said host cells with a first nucleotide sequence encoding at 
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least one at least one phr gene selected from the group consisting ofphrA, phrE, phrF t 
phrG, phrl t and phrK\ 

(b) transforming said host cells with a second nucleotide sequence encoding 
a protein of interest that can be expressed by the host cell; and 

(c) culturing said host cells under conditions to express said first and second 
DNA sequences. 

46. The method of Claim 45, further comprising the step of recovering said 
polypeptide from said culture medium. 

47. A method for production of a protein of interest in a host cell, comprising 
culturing bacterial cells that have been transformed with a recombinant expression vector 
comprising a first DNA sequence encoding a polypeptide that can be expressed by the 
bacteria and a second DNA sequence encoding at least one phr gene selected from the 
group consisting of phrA, phrE, phrF, phrG, phrl, and pftrK.under conditions to express said 
first and second DNA sequences. 

48. The method of Claim 48, further comprising the step of recovering said 
protein from said culture medium. 

49. A method for production of a protein of interest in a host cell, comprising: 

(a) transforming a host cell with a DNA encoding a polypeptide of interest; 

(b) contacting said host cell with a Phr pentapeptide selected from the group 
consisting of PhrA, PhrE, PhrF, PhrG, Phrl, and PhrK; and 

(c) culturing said host cells under conditions to express said polypeptide of 
interest. 
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Figure 3 
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Figure 4 
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Figure 6 
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rap A DNA 

TTGAGGATGA AGCAGACGAT 
TGAATGGTAT ACTCATATCC 
TCAAGCTCGA AGTAGAAAGA 
TTGCTGCTGT ATTATTCTTT 
TTACATTAAG CCTTTTGGAG 
TGTTAGAAGA CATCGAAGGG 
TATTACTTTA ATTTTTTTCG 
TGTCAGTGCC ATGATGTATT 
TCTCGGATGA TATTGAGAAA 
TTTTACAATT TAAAACAAAC 
ATTAGAAACA TACCAAATGT 
GTGAATTCGT TATTGCAGGT 
GCATTGCCCC ACTTAGAACT 
TCCCCGCCTG ATCAGTTCTG 
AAATGGGTGA ACTGCAAAAG 
ATTTGCAAGT CGGAAAAGTT 
AACACAAGTT CTGTATAAAC 
ATCGTGAAGG ATTGGAAATC 
GAGCTTTTTC AATTTTTACA 
ATCAGTCTCA CACACCTTTC 
ATATTGAAGA GCTGGCGCAT 
CAGCCCGAAA AAGCACTTTC 
ACAAATCCAG AGAGGAGATT 



TCCGTCCTCT TATGTCGGGC 
GGCAGTTCCA CGTCGCTGAA 
GAAATTGAGG ATATGGAAGA 
AATGGAGTTC AGGCACCGTG 
AGGACACGTC GCAGCTAGAG 
AATCAGTACA AGCTGACAGG 
AGGAATGTAT GAATTTAAGC 
ATAAACGGGC AGAAAAGAAT 
GCAGAGTTTG CTTTTAAAAT 
CTATGTTTCG ATGAGCTACG 
ATGAAACGTA CACCGTCCGC 
AATTATGATG ATATGCAGTA 
GGCTTTAGAT CTTGCAAAGA 
CCCTATATAA TCTCGGAAAC 
GCAGCCGAAT ACTTTGGGAA 
CGATAATCTT CCGCATTCTA 
AAAAAAATGA CGCCGAAGCG 
GCCCGTCAAT ACAGTGATGA 
TGCGTTATAC GGAAAAAACA 
AATTTCTTGA AGAACATATG 
GATGCTGCCC AATTCTATAT 
ATTTTATGAG AAAATGGTGC 
GTTTATATGA AATCTAA 

Figure 12 



TTAAAATTAA 50 

GCCGAACGGG 100 

AGACCAAGAT 150 

TCATGCTGGA 200 

TTTTCAGAAT 250 

GCTTCTCGAA 300 

AGAAGATGTT 350 

CTTGCCCTCG 400 

GGCTGAGATT 450 

CCGTTCAGGC 500 

AGAATCCAAT 550 

TCCAGAAAGA 600 

AAGAAGGCAA 650 

TGCTATGAGA 700 

ATCTGTTTCT 750 

TCTACTCTTT 800 

CAAAAAAAGT 850 

ATTATTTGTG 900 

TTGACACAGA 950 

CTGTATCCTT 1000 

AGAAAACGGA 1050 

ACGCACAAAA 1100 
1137 



RapA protein 

MRMKQTIPSS YVGLKINEWY 
LLLYYSLMEF RHRVMLDYIK 
YYFNFFRGMY EFKQKMFVSA 
FYNLKQTYVS MSYAVQALET 
ALPHLELALD LAKKEGNPRL 
ICKSEKFDNL PHSIYSLTQV 
ELFQFLHALY GKNIDTESVS 
QPEKALSFYE KMVHAQKQIQ 



THIRQFHVAE AERVKLEVER 
PFGEDTSQLE FSELLEDIEG 
MMYYKRAEKN LALVSDDIEK 
YQMYETYTVR RIQCEFVTAG 
ISSALYNLGN CYEKMGELQK 
LYKQKNDAEA QKKYREGLEI 
HTFQFLEEHM LYPYIEELAH 
RGDCLYEI 

Figure 13 



EIEDMEEDQD 


50 


NQYKLTGLLE 


100 


AEFAFKMAEI 


150 


NYDDMQYPER 


200 


AAEYFGKSVS 


250 


ARQYSDELFV 


300 


DAAQFYIENG 


350 




378 
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phrA DMA 

ATGAAATCTA AATGGATGTC AGGTTTGTTG CTCGTTGCGG TCGGGTTCAG 50 

CTTTACTCAG GTGATGGTTC ATGCAGGTGA AACAGCAAAC ACAGAAGGGA 100 

AAACATTTCA TATTGCGGCA CGCAATCAAA CATGA 135 



Figure 14 



PhrA Protein 

MKSKWMSGLL LVAVGFSFTQ VMVHAGETAN TEGKTFHIAA RNQT 44 



Figure 15 



rapB DNA 

ATGGCCGCGT ACGAGATCCC GTCATCTCAA GTAGGAGTGA AAATCAATAA 50 

GTGGTATAAG CACATTTTAG CATTTCAAGT GGCTGATGCC GTAAAACTGA 100 

AAGAAGAAAT TGATCTAGAT ATCGAACAGA TGGAAGAGGA TCAGCTGCTT 150 

CTCCTGTATT ATCAGCTGAT CAGCTACCGG CACCAAATTA TGCTGGATTA 200 

TGTAAAGCCT GACCTGCACG AGGAATCTCA GCTTCAATAT CGTGAATTGA 250 

TCAAAACGCT GGAAAGCAAT CAGGACAGCA TCTCCGGATT ATCTGAGTAT 300 

TATTTTCACT TGTTTAGAGG AATGTACGAA TTTGAACAGA ACAATTACAT 350 

CTCGGCTATA TCCTTTTATC GGAAAGCTGA GAAGATGCTT GCGTTTGTAG 400 

AAGATGAAAT TGAAAGAGCG GAATTTCATT TTAAAGTGGC GGAAGTCTTT 450 

TATATTATGA AGCAGACGCA CTTTTCCATG AACCATGCCG TACAGGCGCT 500 

GGAGACGTAT AAGGCTCACG ATTTTTACAG AGTCAGAAGA ATTCAATGCC 550 

ATTTCGTCAT ATCAGGAAAC TACATTGACT ACAGAAATTA TGAAAAAGCG 600 

CTTGAACATT TAGATGACGC GTATCGCCTT GCTTTATTGG AAGGACAGCC 650 

CCGTTTGATC GGATCAGCCC TTTATAATAT TGGTAATTGC TATGATGACA 700 
AGGGGGAGCT GGATCAGGCC GCCGAATACT TTGAAAAAGC GCTTCCTGTC * " 750 

TTCGAGGATT ATCAGCTGGA ACAGCTTCCA AAAGCGCTCT TCAGCCTGAC 800 

TCGCGTTCTT TTCAAAAAAC AAGATAGCGA AGCCGCGATA CGGTACTATG 850 

AAAAAGGGAT TGCCATTGCC CAAAAACGAA ACGACTTCTT CAGTCTGGCA 900 

AAATATAAAT TTCTGCAGGC ATTATATGTG GAATCTGTCA ATTTGAATAT 950 

GATTCAAGAA GTATTTGATT ATATGGAAGA AAAGGGCCTG TATGTATATA 1000 

TTGAAGAATT TGCTTTAGAT GCTGCCTCTT ATTTCAGTCA TCGCGAACAA 1050 

TATAAAGAAG CGGTGTATTT TTATGAAAAA GCGGTCAGCA TGAGAGAGAT 1100 

GATTCAAAGG AACGATTGTT TATATGAAGT ATGA 1134 



Figure 16 
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RapB Protein 

MAAYEIPSSQ VGVKINKWYK HILAFQVADA VKLKEEIDLD IEQMEEDQLL 
LLYYQLISYR HQIMLDYVKP DLHEESQLQY RELIKTLESN QDSISGLSEY 
YFHLFRGMYE FEQNNYISAI SFYRKAEKML AFVEDEIERA EFHFKVAEVF 
YIMKQTHFSM NHAVQALETY KAHDFYRVRR IQCHFVISGN YIDYRNYEKA 
LEHLDDAYRL ALLEGQPRLI GSALYNIGNC YDDKGELDQA AEYFEKALPV 
FEDYQLEQLP KALFSLTRVL FKKQDSEAAI RYYEKGIAIA QKRNDFFSLA 
KYKFLQALYV ESVNLNMIQE VFDYMEEKGL YVYIEEFALD AASYFSHREQ 
YKEAVYFYEK AVSMREMIQR NDCLYEV 
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rapC DNA 

ATGAAGAGTG 
GTGGTACAGA 
AAGCTGAAAT 
CTGCTTTATT 
TCTTGAGCCG 
AATTATCAAG 
GACTATTACG 
ATTTATTTCT 
TTGTCGCAGA 
GCTTATTATT 
CGCCTATGAA 
AGTGCCATTT 
CAAGCCGCAC 
AAAAGCCCAG 
ACAATCAAGA 
TCCACATTTG 
TTTAATCACC 
AATATCACAA 
TATGCCGTAA 
TGAAGAAGGA 
ACGCTGATAT 
CAGAAATGGT 
AAGAAAACAA 



GGGTAATTCC 
TATATCCGGA 
CCAGCAGGAA 
ATTCACTAAT 
TTAGAGAAAT 
AAACATTGAC 
TGAATTTTTT 
GCCATTACAT 
CCATATTGAA 
ATATGAAGCA 
ATTTACGTGG 
CGTCTTCGGG 
GCCATTTCAA 
CTGGTTGGAA 
CCTTCTAGAC 
AAAGCAGCAG 
CTGATTTATT 
GCGGGGCTAT 
AATTCGAGTT 
ATCGAACGAT 
AGAGGATTTA 
TTAAACTGTC 
ATACAAAGGA 



TTCTTCAGCG 
CATTCAGCGT 
CTGAAACACA 
GGAATTCCGG 
TAAATATCGA 
AGCAACCAGG 
TCGCGGGATG 
ACTATAAACA 
CGGGCTGAAT 
AACGTATTTT 
AGCAGGAAAC 
GTCAACCTGA 
ATTGGCGCTC 
GAGCATACTA 
CCTGCCATTG 
GATCGTCAAT 
ATAAACAGGG 
GAATATGCTA 
TTTGCAATCC 
GTTTCCAGTA 
GCCCTAGAAG 
TGCTTCCTAC 
GTGAAGGTTT 



GTCGGTCAAA 
GCCGGATGCC 
TGCAGCATGA 
CACCAGCTTA 
AGACCAGCCA 
CAGATCTCAA 
TATGAATTTG 
GGCGGAGAAA 
TCTATTTTAA 
TCATTGATTA 
CTATAATGTG 
TGGATGAAAG 
AACATGGCCC 
CAATCTCGGG 
ATTACTTTGA 
TCTCTCCCGC 
AAAACATGAT 
AAGAAACAGA 
CTATATCTGG 
CTTAAAAAAT 
TAGCAAAATA 
TTTCTACAAG 
GTATGAAATT 



AAATTAACGA 
GAAGTGTTAA 
TTCCAACTTG 
TGCTGGATTA 
AGCCTGTCTG 
AGGGCTGCTC 
ATAAGCGGGA 
AAGCTCTCCT 
AATCGCGGAA 
ATATAAAAAA 
AGAATCATTC 
AAATTTCGAA 
AAGCAGAACA 
TTATGCTATT 
AAAAGCGGTC 
AAGCCTATTT 
AAAGCTTCGG 
CGATGCAGAC 
ATCAGCCCAA 
AAAAATATGT 
TTACTATGAA 
TTGAAGAGGC 
GAAATCTAA 
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600 
650 
700 
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850 
900 
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1100 
1149 
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RapC Protein 

MKSGVIPSSA VGQKINEWYR YIRTFSVPDA EVLKAEIQQE LKHMQHDSNL 
LLYYSLMEFR HQLMLDYLEP LEKLNIEDQP SLSELSRNID SNQADLKGLL 
DYYVNFFRGM YEFDKREFIS AITYYKQAEK KLSFVADHIE RAEFYFKIAE 
AYYYMKQTYF SLINIKNAYE IYVEQETYNV RIIQCHFVFG VNLMDERNFE 
QAARHFKLAL NMAQAEQKAQ LVGRAYYNLG LCYYNQDLLD PAIDYFEKAV 
STFESSRIVN SLPQAYFLIT LIYYKQGKHD KASEYHKRGY EYAKETDDAD 
YAVKFEFLQS LYLDQPNEEG IERCFQYLKN KNMYADIEDL ALEVAKYYYE 
QKWFKLSASY FLQVEEARKQ IQRSEGLYEI EI 
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phrC DNA 

ATGAAATTGA AATCTAAGTT GTTTGTTATT TGTTTGGCCG CAGCCGCGAT 50 

TTTTACAGCG GCTGGCGTTT CTGCTAATGC GGAAGCACTC GACTTTCATG 100 

TGACAGAAAG AGGAATGACG TAA 123 



Figure 20 



PhrC Protein 

MKLKSKLFVI CLAAAAIFTA AGVSANAEAL DFHVTERGMT 40 

Figure 21 



rapE DNA 

TTGATATCAA TCACATCAGC 
TAGGCATATT CAAAAATTCA 
AAATAGAACG TGATATAGAG 
TATTATCAAC TAATGGCCTT 
GCCATCTGAT GAAAACCGAA 
AGGGCCATAA GAAAAAATTA 
TTCCGCGGGA TGTATGAGTT 
ATATTACAAA AAAGCTGAAC 
AAAAAGCTGA GTTTTATTTT 
ATGACACACA TTTCGATGCA 
GAAACACGAG CTTTACTCTG 
CCGGTAACTA CGATGACCTT 
CAAGAAGCTT TAAAAGGTGC 
CTATGCCACT GCGTTTTTCA 
ACTTAAATAA GGCTGCTCGT 
AAAATTAACT CTGACGTACT 
TTATTTTAAA CAAGGCAAAA 
GAATCAGAAG TGCTGTAGAT 
GAAGCATTAG ACGTTCTTTA 
CAACATTTTT TCTCGTTTAG 
AGCTGGCATT GTTAGGAGGC 
GATAGTATCA TCTGTTTCAA 
GAAAGGGGAA TGCATGTATG 



TGAAGTGGGA ATGAAGATTA 
ACGTAACTGA TGCTGAAATG 
GTTATGGAAG AAGATCAAGA 
TCGCCATAAA ATTATGTTGG 
TGGAACTTTC AGAATATTTA 
GACAATATGC GGGCATATTA 
TAGAAACGGT GAATACACAA 
GCAAAATTCC TACAATCTCC 
AAACTGTCTG AAGTCTATTA 
TTACGCTGAG CTTTCATACA 
TTCGTCGTAT TCAGTGCCAT 
GAAAATCATG AAAAAGCTCT 
CGAGCTGCTT AAAAGTAAAA 
ATCTCGGCAA TTGCTATCAC 
TACATCGAAC AAGCCTTGGT 
TCCGCAAGCT TATCATGACC 
AAGGACAAGC AATGGATTGC 
TTTAAAGACG AACTATTTAT 
TATAAGAAAT GGCGATACAC 
AGAACGGCAA AGGATACCCT 
AACCTTTTCG ATTATAATGG 
GAAGATGGTA TATGCTCAAA 
AAATCTAA 



ATGAATGGCA 


50 


TTAAAAGCTG 


100 


CTTGCTTATT 


150 


AATACACGCT 


200 


AATAAAATTG 


250 


TTATAACTTC 


300 


GAGCCATTAC 


350 


GACAAGATTG 


400 


TCATATGAAA 


450 


ACATTTATAA 


500 


TTTGTCATTG . 


550 


CCCCCACCTT 


600 


ATACTCATAT 


650 


AAAATGGACA 


700 


TCAATACAGA 


750 


TAGCGTTGAT 


800 


TTCCGCAAAG 


850 


GAACTTATTT 


900 


CTAAACTTCT 


950 


TATCTGGAAG 


1000 


GAAAATAGAG 


1050 


AGCAAATATC 


1100 
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RapE Protein 

MISITSAEVG MKINEWHRHI QKFNVTDAEM LKAEIERDIE VMEEDQDLLI 50 

YYQLMAFRHK IMLEYTLPSD ENRMELSEYL NKIEGHKKKL DNMRAYYYNF 100 

FRGMYEFRNG EYTRAITYYK KAERKIPTIS DKIEKAEFYF KLSEVYYHMK 150 

MTHISMHYAE LSYNIYKKHE LYSVRRIQCH FVIAGNYDDL ENHEKALPHL 200 

QEALKGAELL KSKNTHIYAT AFFNLGNCYH KMDNLNKAAR YIEQALVQYR 250 

KINSDVLPQA YHDLALIYFK QGKKGQAMDC FRKGIRSAVD FKDELFMNLF 300 

EALDVLYIRN GDTPKLLNIF SRLENGKGYP YLEELALLGG NLFDYNGKIE 350 

DSIICFKKMV YAQKQISKGE CMYEI 375 

Figure 23 



phrE DMA 

ATGAAATCTA AATTGTTTAT CAGTTTATCC GCCGTTTTAA TTGGACTTGC 50 

CTTTTTCGGA TCTATGTATA ATGGCGAAAT GAAGGAAGCA TCCCGGAATG 100 

TAACTCTCGC ACCTACTCAT GAATTCCTTG TTTAA 135 

Figure 24 



PhrE Protein 

MKSKLFISLS AVLIGLAFFG SMYNGEMKEA SRNVTLAPTH EFLV 44 

Figure 25 



rapF DNA 

GTGACAGGTG TCATATCTTC TTCTTCCATC GGAGAAAAGA TTAACGAATG 50 

GTATATGTAC ATACGCCGAT TCAGCATACC CGATGCAGAA TATTTGCGAC 100 

GAGAAATCAA GCAAGAGCTG GATCAAATGG AAGAAGATCA AGACCTTCAT 150 

TTGTACTATT CACTGATGGA GTTTCGGCAC AACCTAATGC TTGAGTACCT 200 

TGAACCGTTA GAAAAAATGA GGATTGAGGA ACAGCCGAGA CTGTCTGATC 250 

TGCTGCTTGA GATTGATAAA AAACAGGCTC GTTTAACTGG TCTGCTTGAG 300 

TACTATTTTA ACTTCTTCAG AGGCATGTAC GAGCTGGACC AGCGGGAATA 350 

TCTGTCGGCT ATTAAATTTT TCAAAAAGGC CGAAAGCAAG CTGATATTCG 400 

TTAAGGATCG GATAGAGAAA GCTGAGTTTT TCTTTAAGAT GTCTGAATCT 450 

TATTACTATA TGAAACAAAC GTATTTTTCA ATGGACTATG CACGGCAAGC 500 

ATATGAAATA TACAAAGAAC ATGAAGCTTA TAATATAAGA TTGCTGCAGT 550 

GTCATTCTTT ATTTGCCACC AATTTTTTAG ATTTAAAACA GTATGAGGAT . 600 

GCCATCTCAC ATTTTCAAAA AGCTTATTCT ATGGCAGAAG CTGAAAAGCA 650 

GCCCCAATTA ATGGGGAGAA CTTTGTACAA TATCGGGCTT TGTAAAAACA 700 

GCCAAAGCCA ATATGAGGAT GCCATACCTT ATTTCAAAAG AGCAATAGCT 750 

GTTTTTGAAG AATCAAATAT TCTTCCTTCC TTACCTCAAG CGTATTTTTT 800 

AATTACACAG ATCCATTATA AATTAGGAAA AATAGATAAA GCTCATGAAT 850 

ATCATAGTAA GGGAATGGCT TATTCACAAA AGGCCGGAGA TGTAATATAT 900 

TTATCAGAGT TTGAATTTTT GAAATCTTTA TACTTATCAG GCCCGGATGA 950 

AGAAGCAATT CAAGGATTTT TTGATTTTCT CGAAAGTAAA ATGTTGTATG 1000 

CTGATCTTGA AGATTTCGCT ATTGATGTGG CAAAATATTA TCATGAACGT 1050 

AAAAATTTTC AAAAAGCTTC TGCTTATTTT TTGAAGGTGG AACAAGTAAG 1100 

GCAACTTATT CAAGGAGGAG TGAGTTTGTA TGAAATTGAA GTCTAA 1146 



Figure 26 
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RapF Protein 

MTGVISSSSI GEKINEWYMY IRRFSIPDAE YLRREIKQEL DQMEEDQDLH 50 

LYYSLMEFRH NLMLEYLEPL EKMRIEEQPR LSDLLLEIDK KQARLTGLLE 100 

YYFNFFRGMY ELDQREYLSA IKFFKKAESK LIFVKDRIEK AEFFFKMSES 150 

YYYMKQTYFS MDYARQAYEI YKEHEAYNIR LLQCHSLFAT NFLDLKQYED 200 

AISHFQKAYS MAEAEKQPQL MGRTLYNIGL CKNSQSQYED AIPYFKRAIA 250 

VFEESNILPS LPQAYFLITQ IHYKLGKIDK AHEYHSKGMA YSQKAGDVTY 300 

LSEFEFLKSL YLSGPDEEAI QGFFDFLESK MLYADLEDFA IDVAKYYHER 350 

KNFQKASAYF LKVEQVRQLI QGGVSLYEIE V 381 

Figure 27 



phrF DNA 

ATGAAATTGA AGTCTAAACT ATTACTCTCT TGTCTGGCTC TAAGCACTGT 50 

GTTCGTGGCA ACAACTATTG CAAATGCACC TACACACCAA ATTGAAGTTG 100 

CACAACGAGG AATGATTTAA 120 

Figure 28 



PhrF Protein 

MKLKSKLLLS CLALSTVFVA TTIANAPTHQ IEVAQRGMI 39 



Figure 29 



rapG DNA: 

ATGAATAAGATCGCACCCGCAGAAATCGCTAGCATGCTCAACGATTGGTACCTTGCCATC 
AAGAAACATGAAGTTGAAGAATCCTCCCGTTTATTTGAAGAAGTGAAGCCTTTATTGGAT 
GACATGGAAGAGGATCAGGAGGTGCTTGCCTACTTCTCCTTATTGGAACTGCGCCACAAG 
GTTTTGCTTCACGAGGCGAGAGGACAGGGCTTTCAGCATGAGGAGCCTTCTCATATGAAT 
GCTACGTCTGACATGCTGAAATATTACTTTTTTCTGTTTGAAGGCATGTATGAGGCCTAT 
AAAAATAATTATGACATTGCCATTGGGCTGTATAAAGATGCAGAGCAGTATCTCGACAAC 
ATTCCCGATCCGATTGAAAAAGCCGAATTTCACCTGAAGGTCGGTAAGCTCTATTATAAG 
CTGGGACAAAATATTGTGTCCCTCAATCATACACGGCAAGCAGTCAAAACATTCAGAGAA 
GAGACAGATTATAAAAAGT^AGCTGGCTTCAGCCCTGATTACCATGTCAGGCAATTTTACA 
GAGATGAGCCAGTTTGAAGAAGCTGAGGCTTATTTGGACGAAGCAATTCGGATCACGAGT 
GAATTAGAGGATCATTTTTTTGAAGCCCAGCTTTTGCATAACTTCGGCCTTCTACATGCG 
CAAAGCGGCAAATCAGAAGAAGCGGTTTCGAAATTAGAGGAGGCTCTACAGAACGATGAG 
TATGCCCGCTCCGCCTATTATTATCATTCTGCCTACTTGCTGATACGAGAGCTGTTTAAG 
ATCAAAAAGAAAGAACAGGCCTTATCTTATTACCAAGACGTGAAGGAAAAATTGACTGCT 
GAGCCGAATAGAATATGTGAGGCAAAAATAGACATTTTATATGCCATTTATGCAGAAGGG 
GGTCATGCGGAAACGTTTCACTTATGCAAACAACATATGGATGACTTGTTGTCCGAGAAA 
GAGTATGACAGTGTAAGAGAACTTTCCATTTTGGCTGGCGAACGGTATAGGGAACTTGAG 
CTTTACAAAGAAGCTGCCCACTTTTTTTATGAAGCATTACAGATTGAAGAACTGATTAAA 
CGAACGGAGGTTATA 



Figure 30 
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RapG Protein 

MNKIAPAEIASMLNDWYLAIKKHEVEESSRLFEEVKPLLDDMEEDQEVLAYFSLLELR^ 

VLLHEARGQGFQHEEPSHMNATSDMLKYYFFLFEGMYEAYKNITyiDIAIGLYKDAEQYLDN 

IPDPIEKAEFHLKVGKLYYKLGQNIVSLlffiTRQAVKTFREETDYKKKLASALITMSGNFT 

EMSQFEEAEAYLDEAIRITSELEDHFFEAQLLHNFGLLHAQSGKSEEAVSKLEEALQNDE 

YARSAYYYHSAYLLIRELFKIKKKEQALSYYQDVKEKLTAEPNRICEAKIDILYAIYAEG 

GHAETFHLCKQHMDDLLSEKEYDSVRELSILAGERYRELELYKEAAHFFYEALQIEELIK 

RTEVI 



Figure 31 



phrG DNA 

ATGAAAAGATTTCTGATTGGCGCAGGCGTCGCAGCGGTGATTTTATCAGGTTGGTTTATTGCGGAC 
CATCAAACCCACTCACAGGAAATGAAAGTCGCTGAGAAAATGATTGGA 

Figure 32 



PhrG Protein 

MKRFLIGAGVAAVILSGWFIADHQTHSQEMKVAEKMIG 



Figure 33 

rapX DNA 

TTGCGGGGTG TTTTCTTAGA TAAAGATAAA ATTCCGTACG ACTTAGTCAC 50 

GAAAAAGTTA AATGAATGGT ATACATCAAT AAAAAATGAT CAAGTTGAGC 100 

AAGCCGAGAT TATAAAAACA GAAGTAGAGA AAGAATTGTT AAACATGGAA 150 

GAAAATCAAG ATGCCCTGTT ATATTATCAA CTATTAGAAT TTAGACATGA 200 

GATAATGCTG AGTTATATGA AATCTAAGGA AATAGAAGAT CTCAATAATG 250 

CTTATGAGAC TATAAAAGAA ATTGAGAAGC AAGGGCAATT AACTGGCATG 300 

TTGGAATACT ATTTTTACTT TTTTAAGGGT ATGTACGAGT TTAGGCGTAA 350 

AGAATTAATT TCAGCGATAA GTGCTTATCG AATAGCTGAA TCAAAGTTGT 400 

CAGAAGTTGA GGATGAAATA GAGAAAGCAG AGTTTTTTTT CAAAGTGTCC 450 

TATGTATATT ATTATATGAA ACAAACATAC TTCTCCATGA ATTATGCAAA 500 

TCGTGCACTC AAAATATTTA GAGAGTATGA AGAATATGCT GTCCAGACTG 550 

TGCGTTGTCA ATTTATTGTA GCAGGAAACT TGATCGATTC ATTGGAATAT 600 

GAAAGAGCCT TGGAACAATT TTTGAAGTCT TTGGAAATTT CCAAGGAAAG 650 

TAACATAGAG CATTTAATTG CAATGTCACA TATGAATATT GGGATTTGTT 700 

ATGATGAATT GAAAGAATAT AAGAAGGCTT CACAACATTT AATTTTAGCG 750 

TTAGAAATTT TTGAAAAATC AAAACATAGT TTCTTAACAA AGACTTTATT 800 

CACTCTAACC TATGTAGAAG CAAAACAACA AAATTATAAT GTTGCTTTGA 850 

TATACTTTAG GAAAGGGCGA TTTATTGCCG ATAAAAGTGA TGATAAGGAA 900 

TACTCAGCGA AATTCAAAAT ATTAGAGGGA TTATTTTTTT CTGATGGTGA 950 

GACTCAATTA ATAAAGAATG CATTTTCATA TCTGGCTTCG AGAAAAATGT 1000 

TTGCTGATGT TGAAAATTTT TCGATTGAAG TCGCTGATTA TTTTCATGAA 1050 

CAAGGAAATT TAATGCTCTC TAATGAATAT TATCGTATGA GTATTGAAGC 1100 

AAGACGAAAA ATTAAAAAAG GGGAGATTAT TGATGAAAAT CAGCCGGATT 1150 

CTATTGGCAG CAGTGATTTT AAGTAG 1176 



Figure 34 
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Rapl Protein 

MRGVFLDKDK IPYDLVTKKL 
ENQDALLYYQ LLEFRHEIML 
L.EYYFYFFKG MYEFRRKELI 
YVYYYMKQTY FSMNYANRAL 
ERALEQFLKS LEISKESNIE 
LEIFEKSKHS FLTKTLFTLT 
YSAKFKILEG LFFSDGETQL 
QGNLMLSNEY YRMSIEARRK 



NEWYTSIKND QVEQAEIIKT 
SYMKSKEIED LNNAYETIKE 
SAISAYRIAE SKLSEVEDEI 
KIFREYEEYA VQTVKCQFIV 
HLIAMSHMNI GICYDELKEY 
YVEAKQQNYN VALIYFRKGR 
IKNAFSYLAS RKMFADVENF 
IKKGEIIDEN QPDSIGSSDF 

Figure 35 



EVEKELLNME 50 

IEKQGQLTGM . 100 

EKAEFFFKVS 150 

AGNLIDSLEY 200 

KKASQHLILA 250 

FIADKSDDKE 300 

SIEVADYFHE 350 

K 391 



phrl DNA 

ATGAAAATCA GCCGGATTCT ATTGGCAGCA GTGATTTTAA GTAGTGTATT 50 

TTCAATAACT TATTTGCAAA GTGATCATAA TACTGAAATT AAAGTTGCTG 100 

CAGATCGGGT AGGGGCATAG 120 



Figure 36 



PhrX Protein 

MKISRILLAA VILSSVFSIT YLQSDHNTEI KVAADRVGA 39 



Figure 37 



rapK DNA 

ATGAGTAAGA TCGCTTCTGA AGTTGTCGCT 
CATTGCTATA AAAAAACAAA AGGTTGATGA 
AGATAAAGAA ACTTTTTGAT GAAATGGAAG 
TATTATAGTC TATTAGAAGA AAGACATAAA 
AGGAGAGCCT TTACAAAAGC ACACCTATTT 
TCATAACAAA AACAAATGAT AAATTAGAAT 
GCAATGTACG AGGCATACAA CAAAAACTAT 
TGGATTAGCT GAGAAAAAGC TTGCAGAAAT 
CTGAATTTTA CTCTAAAGTC TCTTACTTAT 
ATTGTGGCAC AACATTATAT AAAAAATGCA 
CCCTGATTAT AAATGCAAAC TAGCTACATC 
ACTATGCTGA TATGAAACGA TTTGAGGAAG 
GCAATTGATA TTGCAAAAGA AACAAAAGAT 
ATTTCACAAT CTTAGTATCG TTTATTCTGA 
GCATTGAATC TCTTGAAAAA GCAATAGGAA 
ATTTATTATA TAAATTCTTT ATTCATGATG 
TGACGAAAAA ATGAAAGCCA TTAATTTTTA 
TCATATTAAT GGAGAATAAA GTATACGAAG 
AACCTTTATT GTGGGGAATT AAAAAATAAT 
TATTGAGTTT TTAAAACAGC AAAATGAACT 
CCTACATAGC TGCAAAAAGG TTTGAATCAA 
ACGAGCTTTT TCAATGCGAA AATTTGGGCT 
GGAGGGAATC TTATGA 



ACTACACTGA ATGACTGGTA 50 

ATCAATAAAA TATTATTCAG 100 

AAGATCAAGA AGTTCTTGCG 150 

ATGTTGCTGC ATTCTTCACG 200 

TACTGAAGAC AATCAAAACT 250 

ACAACTTTTA TTTATTTGAA 300 

GATCGAGCAA TTAACCTATA 350 

TCCAGATGAA ATTGAAGCAG 400 

ATACTCTTGT TAAACAAAGC 450 

ATTTCAATAT ATAAGCGACA 500 

AACAATGATT GCAGCTGCAA 550 

CAGAACAATA TTACTTAGAA 600 

GAATTTTTAA AAGCTCAATT 650 

TTGGAACAAA CCTGATAAAT 700 

ATGAATCTTG GTTACATTCG 750 

ATTAAAGAAC TCTTTAAAAT 800 

CAATAAAGCA CAGGAAAGAC 850 

CCAAAATCAG CATCCTGTAT 900 

TTCAATAATT GTATTAGTAA 950 

TGAAAGTGTA GATGAATTGT 1000 

TAGGTGCTTT TGAAGAAGCA 1050 

GAACAGAAAA TGAATCAGGT 1100 

1116 



Figure 38 
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RapK Protein 

MSKIASEWA TTLNDWYIAI KKQKVDESIK YYSEIKKLFD EMEEDQEVLA 50 

YYSLLEERHK MLLHSSRGEP LQKHTYFTED NQNFITKTND KLEYNFYLFE 100 

AMYEAYNKNY DRAINLYGLA EKKLAEIPDE IEAAEFYSKV SYLYTLVKQS 150 

IVAQHYIKNA ISIYKRHPDY KCKLATSTMI AAANYADMKR FEEAEQYYLE 200 

AIDIAKETKD EFLKAQLFHN LSIVYSDWNK PDKCIESLEK AIGNESWLHS 250 

IYYINSLFMM IKELFKIDEK MKAINFYNKA QERLILMENK VYEAKISILY 300 

NLYCGELKNN FNNCISNIEF LKQQNELESV DELSYIAAKR FESIGAFEEA 350 

TSFFNAKIWA EQKMNQVEGI h 371 

Figure 39 



phrK DNA 

ATGAAAAAAC TTGTGCTTTG CGTATCTATT TTAGCTGTGA TTTTAAGTGG 50 

AGTAGCTTTA ACGCAATTGA GTACAGATTC ACCATCTAAC ATCCAGGTAG 100 

CTGAAAGGCC AGTCGGAGGT TAA 123 



Figure 40 



PhrK protein 

MKKLVLCVSI LAVILSGVAL TQLSTDSPSN IQVAERPVGG 40 



Figure 41 



